



?■' 


entm 

fMLAM (Jjiipiir .^h.tv) 
Cbss No :- S ^ 6^ 



I 

^ S ^ 'T I 

Accession No ;- / ^5. S ^ 


Bo'ik No 
Acc(^ssjon 





The Theory of Determinants, 
Matrices, and Invariants 



RLACKIE & SON LIMITED 
50 Old Bailey, London 
17 Stanhope Street, Glasgow 

BLACKIE Sc SON (INDIA) LIMITED 
Warwick House, Fort Street, Bombay 

BLACKIE Sc SON (CANADA) LIMITED 
1118 Bay Street, TORONTO 



The Theory of 
Determ inants, Matrices 
and Invariants 


BY 

H. W. TURNBULL, M.A. 


Regius Professor of Mathematics in the United College, University of 
St. Andrews. Late Scholar of Trinity College, Cambridge 
and Fereday Fellow of St. John’s College, Oxford. 


BLACKIE & SON LIMITED 

LONDON AND GLASGOW 



First issue igaS, 

Second impression 1929, 


Printed in Great Britain hy Blackie & Sent Ltd.. Glasgoiv 



PREFACE 


This book has grown out of a short series of lectures which 
were given in August, 1926, at the St. Andrews Congress of the 
Edinburgh Mathematical Society. It was the aim of those lec- 
tures to present in outline the salient features of the Invariant 
Theory, from its origins in the early forties of last century to 
the present day. But in the course of filling in the sketch, it was 
borne in upon me, more and more clearly, as the argument pro- 
ceeded, that the subject takes its rise far earlier. 

For this reason I have followed the method of Salmon in 
opening with an account of determinants. This also made it 
desirable to introduce the rudiments of another great depart- 
ment of algebra — the theory of matrices. These will chiefly be 
found in the first seven chapters, which have been writUn 
mainly with a view to their applications in what follows. It is 
no exaggeration to say that the well-known theorem, given by 
Laplace, for the development of a determinant, plays an essential 
part in all the main theorems of the symbolic invariant theory, 
as here adopted, with the one striking exception of the Basis 
theorem of Hilbert. 

I am glad to acknowledge the great debt which mathema- 
ticians owe to Sir Thomas Muir for his charming History of 
Determinants which is at once a monument and an inspiration. 
If the present book encourages the reader to turn to the History 
and explore its farther fields, one of my objects will be attained. 
Here the subject is confined to what is called determinants in 
general and compound determinants. Perhaps the reader will 
also be tempted to dip into the buoyant papers of Sylvester 
{Collected Works) and the systematic treatise by Cullis {Matrices 
and Determhioids), who so generously displays the significance 
of the earlier writings by Sylvester, 



VI 


PREFACE 


As to the invariant theory itself, an attractive approach to 
binary and ternary forms has for many years been accessible 
through the admirable treatises by Elliott {The Algebra of 
Qmntics, Oxford, second edition, 1908) and by Grace and Young 
{The Algebra of Invariants, Cambridge, 1902), the former de- 
veloping the direct, and the latter the symbolic methods. But 
during the present century considerable advances have been 
made in studying quaternary and higher forms (involving four 
or more homogeneous variables), both in the algebra itself and 
in its application to physics through the concept of Relativity. 
Accordingly, while I have incorporated just enough of the binary 
theory to give a short connected exposition of its develop- 
ments, my chief concern has been with the general forms. 

Whatever completeness may attach to the present argument 
is finally due to the memoirs and recent books by Weitzenbock ^ 
and Study.^ To the former belongs the credit of extending the 
work of Clebsch and Gordan from the binary to the general 
case. But perhaps the most remarkable service which he has 
hitherto rendered is to give a complete account of the basis 
of anal)d;ical projective geometry in relation to all the usual 
metrical forms, Euclidean and others. An exposition of these 
results is given in Chapter XXI. 

In such a far-flung theory, with all its great ramifications into 
pure algebra, the theory of groups, projective and differential 
geometry, somewhere or other the line must be drawn: and this 
has been done as follows. First, beyond a bare introduction to 
each (Chapters XX and XXI), the two chief applications, to 
algebraic and differential geometry, have been omitted. How- 
ever logically appropriate fuller treatment would have been, 
it was felt that justice could not be done to what is an extra- 
ordinarily attractive and penetrating type of analytical geometry 
in three-fold and higher space, at the end of a long algebraic 
theory. But the reader can find a full account of the plane 
geometry in the later chapters by Grace and Young. 

Secondly, there is no mention of the interesting algebra of 
dUernate nunibers for which ax6 = — b X a. These have a 

^ InmriarUerUheorie (Gromngen, 192Z), 

^Eirdeitung in die Thecrie der Invarianten linearer Transformationen auf 
Grund der Veciorenrechnung (Braunschweig, 1923 ). 



PREFACE 


vit 

long historical record in the work of Grassmann, Whitehead, 
Scott and Matthews, and others. The omission calls for some 
explanation, because in the deft hands of Dr. Weitzenbock, a 
key to the general invariant theory is provided by complex 
symbols, which are a type of alternate numbers. But it was 
foimd that by enlisting the full implications of Sylvester’s 
Theorem (1851) (p. 48), the ordinary symbols provide quite a 
natural medium for the whole general theory, from beginning 
to end. 

There is also no attempt to grapple with all the details in the 
theory of canonical forms and invariant factors; but the neces- 
sary suggestions for further reading have been made at suitable 
stages. Neither has room been found for the discussion of special 
complete systems; nor for the extensive theory of modular 
invariants which have lately received great attention in America. 

Here and there, illustrative examples have been included, 
often as straightforward applications but occasionally as more 
advanced problems and suggestions for further inquiry and re- 
search. Among examples of determinants and matrices are 
several for which I am indebted to Professor E. T. Whittaker 
and Dr. A. C. Aitken. ^ 

My best thanks are due to my colleague. Dr. W. Saddler, ror 
his ripe judgment and criticism in reading the work, and for 
offering many valuable suggestions; and to Dr. J. Williamson for 
reading the proof-sheets and giving further helpful advice; and 
also to Dr. J. Dougall for his expert and very efficient help in 
removing both mathematical and typographical blemishes. 

H. W. TURNBULL. 

St, Andrews, June 7023 




CONTENTS 


(Chapter I 

MATRICES AND DETERMINANTS 

Page 


1. Notation I 

2. Definition of Matrix - 2 

3. The Transposed Matrix ------- 5 

4. System of Linear Equations ------ 0 

5. Linear C^mibinations of Rows or (Columns. Number Field. 

Rank 8 

6. Linear Equations which are not Homogeneous - - - 10 

7. Condition of Solubility 11 


(*H AFTER II 

FUNDAMENTAL PROPERTIES OF THE DETERMINANT 


1. Derangements - 13 

2. The C+ and C— Classes - - - - - • - 15 

3. Definition of Determinant 17 

4. Arrangement of Terms in the h^xpansion of a Determinant. 

Co-factors -------- 19 

5. Laplace’s Development (jf a Determinant - - - - 22 

6. Algebraic Complements and JMinors of Order r - - - 26 

7. Determinantal Permutation 27 


Chapter III 

LINEAR PROPERTIES. FUNDAMENTAL 
IJ^PLACE IDENTITIES 


1. Linearity. Homogeneity 30 

2. Special Determinants - -- -- --32 

3. Double Suffix Notation and other Ck)ntractions - - - 32 

4. A Determinant is irresolublo into Factors - - - - 33 

ix 



j CONTENTS 

5. Rules for Combining Matrices - - - 

6. Currency of a Matrix - - - - 

7. Transposition Properties of Determinants - 

8. Fundamental Laplace Identities 

9. Fundamental Identities of Order n - 

10. Implicit and Explicit Convolution 

11. General Fundamental Identities of Order n 

12. Linear Relation between n \ Linear Forms 

13. Principle of Duality - - - - - 


Pa(<e 

34 

37 

38 
41 
44 

46 

47 
60 
51 


Chapter IV 

MULTIPLICATION OF MATRICES AND DETERMINANTS 


1. Fundamental Laws of Algebra - . - - - - 57 

2. The Law of Multiplication of Matrices - - - - 59 

3. Product of Square Matrices of Order n - - - - 61 

4. Double Suffix Notation of Multiplication - . - - 63 

5. The Division I^w - -- -- -- -64 

6. Products of Determinants ------ 65 

7. Reciprocal and Ad jugate Determinants ... - 66 

8. The Index Law and the Reversal Law of a Matrix - - 68 

9- Summary of Laws of Matrices ------ 70 


Chapter V 

LINEAR EQUATIONS. THE THEOREM OF CORRE- 
SPONDING MATRICES. FURTHER THEOREMS 


1. Matrices and Linear Equations. Rank , - - - 73 

2. Application to Linear Equations ----- 75 

3. The Upper Suffix Notation ------ 77 

4. The Theorem of Corresponding Matrices - - - - 79 

5. Inner Product of Two Rectangular Matrices - - - 82 

6. Laplace Developments of the Inner Products - - - 83 

7. Rank of the Product of Matrices 84 

8. The Simplex - -- -- .- --84 

9. Extended Form of Cauchy’s Theorem, commonly called 

Sylvester’s Theorem on Compound Determinants - 87 

10. The Generalized Ratio Theorem 89 

11. Tensor Constants of the Fundamental Identities - - 90 

12 . Application of the Principle of Duality . - - . 92 



CONTENTS 5 

Page 

13. The Sylvester Identity 93 

14. Formal Proof of the Sylvester Identity - - - - 95 

Chapter VI 

SPECIAL TYPES OF DETERMINANT 

1. Properties of Matrices and Detenninants connected with the 


Treading Diagonal ------- 98 

2. The Cayley Hamilton Theorem ----- 99 

3. Special Types of Determinant - 101 

4. Reciprocation of Bordered Determinants - - - - 102 

5. Bordered Ad jugate Determinant ----- 104 

6. Symmetrical Matrices and Determinants - - - - 104 

7. Skew Symmetric Determinants ----- 105 

8. Characteristic Function of a Skew Matrix - - - - 107 

9. Summary of Theorems on Compound Determinants - - 107 


Chapter VII 

DIFFERENTIATION OF A DETERMINANT 


1. The Polarizing Process 110 

2. The Capelli Operators - - - - - - -112 

3. The Cayley Operator - - - - - - - 114 

4. Theorem of Corresponding Matrices adapted to the Capelli 

Operator --------- 116 

5. Connexion between Substitutional Analysis and Differen- 

tiation --------- 119 

6. Jacobians - -- -- -- -- 124 

7. Rank of Jacobian Matrix 126 


Chapter VIII 
BINARY FORMS 


1. Binary Invariants 128 

2. Orthogonal Transformation and Invariants - - - 130 

3. Development of the Invariant Theory - - - - 132 

4. The Binary Form or Quantic 133 

5. Gradient. Degree and Weight ------ 134 

6. The Induced Linear Transformation of the Binary n-io - 135 

7. Polar Forms 137 

8. Formal Definition of Invariant ------ 138 



CONTENTS 


9. Simultaneous Invariants 139 

10. The Aronhold Operator - - - - - - - 140 

11. Multilinear Invariants 142 

12. Covariants - -- -- -- -- 143 

13. Relation between Linear Forms and Covariants - - - 145 


Chapter IX 

THE GENERAL LINEAR TRANSFORMATION 


1 . Cogredience and Contragredience - - - - - 147 

2> Linear Transformations in Matrix Notation - - - 149 

3. Orthogonal Transformations and Matrices - - - - 152 

4. Cayley’s Determination of the Orthogonal Matrix whose 

Determinant is Positive - - - - - - 155 

5. Linear Transformation with Absolute Quadric - - - 158 

6. Group of the Orthogonal Matrix - - - - - 160 

7. Dimensions of the Transformation Group - - - - 161 

8. Induced Compound Transformations - - - - 163 

9. Connexion between Matrices and Quaternions - - - 166 


Chapter X 

GENERAL PROPERTIES OF INVARIANTS 


1. Linear Transformation of the General Form of Order p - 168 

2. Projective Invariants 159 

3. Homogeneity of Invariants - - - - - -171 

4. Ground Forms - - - - - - - -172 

5. Symbolic Notation - - - - - - - -173 

6. Symbols for Forms in Three or More Variables - - - 175 

7. Polar Forms - - - - - - - - -177 

8. Equivalent Symbols 179 


Chapter XI 

THE FIRST FUNDAMENTAL THEOREM 


1. Symbolic Factors. Inner and Outer Products - - - 182 

2. Effect of Linear Transformation oA the Symbols - - - 183 

3. Converse Theorem 184 

4. The Valency Condition pq s= nw + tn for Single Ground 

Form 186 



CONTENTS 


xiii 

Pajre 


5. First Fundamental. Theorem for a System of Linear Forms 187 

6. Invariants of One or More General Ground Forms - - 189 

7. Examples of Invariants. Interchange of Equivalent Symbols 191 

8. Double Convolution of Symbols referring to a Quadric Form 193 

9. Solution of Symbolic Linear Equations - - - - 195 


Chapter XII 
MULTILINEAR FORMS 


1. Multilinear Forms -------- 197 

2. Symbolic Representation of Multilinear Forms - - - 198 

3. Classification of Multilinear Forms ----- 199 

4. Cogredient and Contragredient Symbols - - - - 200 

5. Equivalent Symbols ------- 201 

6. Effect of Linear Transformation on the Symbols - - - 201 

7. Fundamental Theorem ff)r the General Multilinear Form - 203 

8. Covariants, Contravariants, and Mixed Concomitants - - 206 

9. Convolution and Resolution ------ 207 

10. The Fundamental Theorem for the General Case - - - 208 

11. Proof of the Fundamental Theorem - - - - - 210 


Chapter XIII 

SYMBOLIC METHODS OF REDUCTION 


1. The Fundamental Identities 213 

2. The Second Fundamental Theorem - - - - - 214 

3. Binary Quadratic Forms. Reducibility - - - - 215 

4. Significance of the Complete System 218 

6. Canonical Form of Two Binary Quadratics - - - - 219 

6. Extension to Forms of Higher Order - - - - 221 

7. Transvectants 221 

8. Reducibility of Jacobians 223 

9. Remarks on the Proof of the Second Fundamental Theorem' 225 


Chapter XIV 

SEMINVARIANTS. ALGEBRAICALLY COMPLETE SYSTEMS 

1. Seminvariants and Leading Term of a Concomitant - - 226 

2. Seminvariants as Solutions of Partial Differential Eqxiations 227 

3. Algebraically Complete Systems, Syzygies - . . 231 

4. Irreducibility. Gordan’s Theorem 233 



XIV 


CONTENTS 


Chapter XV 

THE GORDAN-HILBERT FINITENESS THEOREM 


Page 

1. Hilbert’s Basis Theorem 235 

2. Proof of Gordan’s Theorem ------ 238 

3. Limit to the Number of Syzygies 239 

4. Multiple Fields 240 

5. Combinants --------- 242 

6. Further Examples of Complete Systems. The Binary Cubic 244 

7. The Binary Quartic Form ------ 245 

8. References to Complete Systems 246 


Chapter XVI 
CLEBSCH’S THEOREM 


1. Introduction of Clebsch’s Theorem 248 

2. Compound Polars. Standard Forms . - . . 249 

3. Reduction to Standard Form ------ 250 

4. The Gordan-Capelli Series ------ 253 

5. Examples of the Series for Binary and Ternary Fields - 255 

6. Normal Forms - -- -- -- - 255 

7. Historical Note - -- -- -- - 258 


Chapter XVII 

APPLICATIONS OF CLEBSCH’S THEOREM. APOLARITV 
AND CANONICAL FORMS 


1. Similar Forms 259 

2. Types 260 

3. Peano’s Theorem - -- -- -- - 261 

4. Dual Similar Forms 262 

5. Apolarity 264 

6. Apolarity of Dissimilar Forms ----- 264 

7. Canonical Forms 265 

8. Counting Constants is not Sufficient 207 

9. Proof of the Lasker- Wakeford Theorem - . - - 268 



CONTENTS 


XV 


Chapter XVHl 

INVARIANT EQUATIONS AND GRAM’S THEOREM 


Page 

1. Expression of a Gradient by Coefficients of Covariants - 270 

2. Invariant Equations 271 

3. Gram’s Theorem - - - - - - - - 27 1 

4. Grace’s Theorem 273 

5. Invariants as Elimination Results ----- 274 

6. The Equivalence Problem ------ 277 

7. Extension of Stroh’s I-icmma - - - - - - 278 


Chapter XIX 

GEOMETRICAL INTERPRETATIONS OF ALGEBRAIC 
FORaMS 


1. Homogeneity and (Virreapondence ----- 280 

2. Principle of Duality 282 

3. Further Binary Results 284 

4. Connexion of Binary with Higher Fields - - - - 286 

5 . The Clebsch Transference Principle. Extensionals - - 287 

6. Projective Properties ------- 290 

7. First Geometrical Interpretation of Linear Transformation. 

Collineation -------- 291 

8. Latent Points of a Transformation ----- 292 

9. Second Geometrical Interpretation of Linear Transformation. 

Change of Frame of Reference ----- 293 

10. Reciprocation and (V)rrclation ------ 294 

11. Correlation --------- 295 

12. Canonical Form of a Matrix ------ 295 


Chapter XX 

THE GENERAL QUADRIC 


1. Complete System of the General Quadric - . . - 297 

2. Self -con jugate Simplex ------- 299 

3. Canonical Form of the Quadric ----- 300 

4. Theory of Two Quadrics ------- 301 

5. Reduction of Two Quadrics to Sums of Squares - - - 302 

6. Complete System of (n + 1) Invariants - . - . 304 

7. Complete Systems involving Variablc-s - . - - 306 



XVI 


CONTENTS 


Chapter XXI 


MISCELLANEOUS RECENT DEVELOPMENTS 

raffe 

1. Restricted Transformations 309 

2. Preparatory Reductions leading to the Proof of the Funda- 

mental Theorem - - - - • - -310 

3. Characteristic Invariant Property 312 

4. Proof of the First Fundamental Theorem - - - - 312 

5. Consequences of the Theorem - - - - - - 314 

6. The Orthogonal Group 315 

7. Fundamental Theorem of Orthogonal Transformation - - 316 

8. First Fundamental Theorem for Proper Orthogonal In- 

variants 318 

9. The Hermitian Transformation with an Absolute Quadric - 320 

10. Geometrical Significance of the Adjunction Theorem - - 323 

11. Remarks on the Adjunction Theorem ... - 325 

12. Connexion between Differential and Projective Invariants - 326 

13. Prepared Systems 329 

14. Quantitative Substitutional Analysis 330 

Index 335 



The Theory of Determinants, 
Matriees, and Invariants 


CHAPTER I 

Matrices and Determinants 


L Notation. 

The fundamental importance of determinants as working tools 
in mathematics has come to be so widely recognized that it may 
be assumed that the reader has some practical knowledge of them 
and in particular that he has realized their value in providing a 
simple general rule for the solution of linear equations. Certain 
introductory results may therefore be given without undue em- 
phasis on intermediate steps, which can easily be supplied. 
Our first object is to learn a notation and a few important 
definitions. 

Suppose there are two homogeneous linear equations in three 
variables x, y, z, 

aia; + 6iy-|-Ci2==0, ) 

azX-\-hzy-\- CzZ= 0 . ) 


Then in general they have a solution 
X y 


bi C2 ^2 ^ ^2 ^2 ^2 ^2 


( 2 ) 


These denominators, which are called determinants of the second 
order ^ can be written shortly in various ways, all of which have 
great value: 


1 ct] 62 1 , 'I 

(a6)i2 , r . (3) 

{ab ) . ) 


( 1 ) 884 ) 


(i) I& 1 C 2 I. Ka2l. 

(ii) (6 c)i 2 , (ca)i2 , 

(iii) (6c) , (ca) , 


2 



z MATRICES AND DETERMINANTS [Chap. 

The last of these ways makes use of the obvious fact that if two 
letters he are written down side by side, one is first and the other 
is second, read from left to right. We agree to drop the suffixes 
in (iii), Avhenever they are 1, 2, for exactly the reason that we 
drop the index 1 in writing when p — 1. In fact we define 
{bc)ij to mean and merely suppress the suffixes ij in the 

case when i = 1, j — 2. 

A fourth and more familiar notation for the determinant 
fciCg — 62^1 ^he w-ell-known square array, introduced by Cayley^ 
in 1841 long after determinants (and much that will concern us 
in this chapter) were first invented. It is 


which has the advantage of showing such coefficients of the original 
equations, as appear in the first determinant, exactly in their same 
relative positions. 

This leads to still more ways, all useful, of writing down the 
solution of equations (1): — 


z = 

61 Cl 


Cl«l 


<hW 



^2 ^2 







(ii) x:y:z=^ {be ) : (ca) : {ah ) , ' . (5) 


(iii) X, y, ZQC 


h-y 

^2 ^2 ^2 


In each of these cases three equations have been grouped into one 
statement. Only in (iii) we note that an essentially new idea is 
present: the double vertical lines,^ before and after the rectangular 
array, signify that determinants are to be chosen therefrom by 
suppressing in turn the first, second, and third column of letters, 
and at the same time retaining the orders 6, c; c, a; a, 6 of the 
columns. 


2. Definition o! Matrix. 

There is obvious importance in adopting a methodical arrange- 
ment of equations and all such polynomial expressions, involving 

^ In 1841, Collected WorkSt 1, 1. 

^This notation has sometimes also been used to denote the matrix of the 
array. 



I.] 


NOTATION 


3 


several variables x, y, z. Also, because of the convenient fact that 
many of the properties of a square or oblong formation can be 
illustrated by arranging four or six things two by two in a square, 
or two by three in an oblong, we can continue to extract useful 
general notions from our equations (1). The set of coefficients 

Cl 

( I2 ^2 ^2 

of (1), arranged in their relative positions, is an example of a 
matrix of orders two and three. A matrix of orders m and n simply 
means a set of mn numbers arranged in rectangular array with m 
rows and n columns. 

At first sight such a definition strikes one as awkward and 
vague, for the question naturally arises in the mind, what shall 
we do to these numbers, shall we add or subtract them or form 
them into determinants? Nevertheless it is exceedingly useful 
to train ourselves to think of an array of numbers as a single 
thing with properties of its own, and to hold ourselves in readiness 
to operate on the terms or elements of the array in any convenient 
way that suggests itself, as in fact we have done in the preceding 
results (2), (3), (5). We are indeed all familiar with this idea, for 
ordinary Cartesian co-ordinates 

?/> A 


of a point in space provide a simple instance. Here the matrix 
is of orders one and three. This involves more than merely three 
numbers x, y, z\ it is three numbers together with a specific 
relation between them; namely, that they are ordinally arracged. 
In general when a;, y, z differ, the geometrical interpretation of 
the different arrangements 

[x, y, zl [x, z, yl [y, x, z], [y, z, x], [z, x, y], [z, y, x] 


is six different points : and this is hint enough that regarded as 
algebraic elements (molecules, if we like), we may with advantage 
study the behaviour of matrices, always treating them as single 
integral things, and not as elaborate clusters of component parts. 
Just as Cayley first provided us with the well-known square 
notation for determinants (Cf . (4) ) so also we have to thank him 



4 


MATRICES AND DETERMINANTS [Chap. 

for first ^ enunciating this principle. He, however, confined the 
definition of a matrix to a square formation only. 

Let us agree to use brackets [ ] for enclosing the constituents 
of a matrix, and incidentally for expressing co-ordinates of a 
point, in plane or space, so that we can now proceed to discuss 
the matrix M of the coefficients of linear equations (1), and write it 



We can also with advantage notice that there is a matrix X of 
the homogeneous variables y, z in the equations, namely 

X = [x, y, z\. 

It is a simple but far-reaching fact that for a given system of 
equations, arranged by columns and rows as in (1), there are these 
two matrices M and X, One cannot exist without the other. 

It was said that in general equations (1) have a solution. By 
this is meant all cases in which the two equations are effectively 
distinct, a state of things that only breaks down if 

I I Cjj^ == • ^2 • ^2* 

When this happens the coefficients of one equation are propor- 
tional to those of the other, and the two equations furnish no 
more information about cc, y, z than either of them alone would do. 
It is then impossible to derive solutions (2) from (1), still less 
the results (5). If we define the phrase determinants of the matrix 
M to mean all the determinants (6c), (ca), (a6), we may state that 
the equations (1) are soluble unless all the determinants of the 
coefficient matrix M are zero. 

Suppose two of the determinants (6c), (ca) vanish. Then, 
eliminating Cj, Cg it follows that (a6) also vanishes. Hence a suf- 
ficient condition, for the insolubility of equations (1) in the form 
(2), is that two of the three determinants of M vanish. 

If, however, only one determinant vanishes, (6c) say, then 
a; = 0, y\z^ (ca) : (ah). And we may define equations (2) to 
have this meaning, although standing alone the ratio x : (6c) 
would now be meaningless and could not be employed. 

Just as there is geometrical significance in [0, 0, 0] which 

^ PhiL Trana, (1858); Collected Works, 2, 475. 



MATRICES 


IJ 


5 


denotes the set of co-ordinates of the origin, so we may presume 
that the null matrix 



has algebraic significance, although in relation to equations (1) 
it appears to indicate their non-existence. 

We may sum up this little investigation by attaching a special 
term rank to a matrix. The rank of 


Ui, 

.^2> 



is two, unless all the determinants c^a^— c^a^, 

Or^b^ — a^b^ vanish, in which case it is one, unless again all six 
elements a^, b^, b^, vanish, in which case it is zero. 


EXAMPLES 

1. If a^x + h^y + c^z ~ 0, + h^y -f- c^z — 0 are the Cartesian 

equations of two distinct planes, prove the rank of the coefficient matrix 
is two. If the rank is one, what is known about the planes? 

2. If these ecpiations refer to lines in a plane, in areal (or other homo- 
geneous) co-ordinates, what is the significance of the rank of their matrix? 

3. A two-by-three matrix has rank unity. Show that if its rows denote 
areal co-ordinates of a point, each row' denotei^ the same point. 

What other two-by-three matrix has rank unity? 

Ans. A matrix in which one row is three zeros, 

4. A three-by-two matrix has three rows and tw'o columns. If each 
row is interpreted as Cartesian co-ordinates of a point in a plane, show 
that its three determinants all vanish if the three points are in line with 
the origin. 


3. The Transposed Matrix. 

If we interchange columns and rows without disturbing the 
order of either, reading columns downwards and rows from left 
to right, we obtain the transposed ^ matrix. Let us use an accent 
to denote the transposed matrix. Thus the transposed of M is 


M' = 


Uj, CI2 
61, hj . 
^2- 


^ Sometimes called conjugate matrix. 



6 


MATRICES AND DETERMINANTS [Chap. 

If we transpose ilf ' we obtain ilf, so that here we have an example 
of a conjugate or symmetrical relation between two things. So 
also the transposed of X = [x, y, z\ is 



and vice versa. The determinants of the matrix M' are (6c), (ca), 
{ah) which are the same as those of M. More precisely the relation 


6i 62 


61 Cl 

Cj Cg 


^2 ^2 


shows the identity of corresponding transposed determinants. 
But we should naturally think of the determinants of M as forming 
a row (i.e. a matrix) of three elements, of the same pattern as Z, 

[(6c), (ca), (ah)] 

while those of M' form a column of three. 

Owing to the practice of writing from left to right, rather than, 
as the Chinese do, from top to bottom, we have never accustomed 
ourselves to thinking of co-ordinates of a point written downwards 
as in X\ It will later appear that this novel way sometimes has 
very great advantages. But occasionally, in order to save space, 
a column matrix will be written horizontally and enclosed in 
brackets { |. Thus 

X' {x, y, z}. 

4. System of Linear Equations. 

Before dealing with the general case involving n variables, 
let us consider a set, or system, of three linear equations homo- 
geneous in four variables 

% X + 6j y -f c^s + rfj/ — 0, 

+ ^ 2 ?/ + 

^3?/+ ^32^+ 0. ... (6) 

Multiplying these respectively by {6c)23, (hc)^^, (bc)i2 and adding 
we find that all terms involving y on z disappear, and that the 
result may be written 

(a6c) X + {dbc) ^ == 0 , ( 7 ) 



SOLUTION OF LINEAR EQUATIONS 


7 


10 

where (abc) is a convenient symbol for the coefficient of x in the 
result, and (dbc) for that of t, . Thus 

{ahc) -- % (6c)23 + {bc)^^ + ^3 

+ — a^b^c^. ( 8 ) 

Likewise 

{(Ibc) ^ di?>2^3 — -r 

Manifestly the series for {abc) may also be written 

^ 1 ^ 2 ^3 "1“ ^ 1 ^ 2 ^3 ^ 1 ^ 2 ^3 "H ^ 1 /^ 2 ^3 ^1^2^3> (^) 

and further, it is clear, on expansion, that the following equalities 
are true 

(abc) = {bca) ~ (cab) ^ — (acb) ~ — (bac) — (c6a). (10) 


Before Cayley introduced the notation 

Qi b^ Cl 

^2 ^2 ^2 
^^3 ^3 


( 11 ) 


for this series (8), which is a deter tnwanf of the third order, it was 
frequently written 


^ db ^q^2^3J 


( 12 ) 


the summation indicating either that a, b, c are to be deranged in 
all six possible ways, as in ( 9 ), without deranging the suffix order 
1, 2, 3 , or vice versa, as in series (8), the suffixes are deranged but 
the letters are not. The 4: sign here indicates that some terms 
have a positive sign and some a negative, the choice depending 
on a rule to b^ presently explained. 

If we also solve for y or z, as in ( 7 ), we obtain in general 

^ ^ ^ -'ll, . . . ( 13 ) 

(6crf) {acd) {abd) {abc) 


as should be carefully verified. The negative signs occurring 
with the alternate variables y, t are inserted to maintain 
the alphabetical order in the denominators. For these deter- 
minants are obtained in the Cayley notation by suppressing 



8 MATRICES AND DETERMINANTS [Chap. 

in turn each one of the columns of the coefficient matrix 

bi q di" 

^2 ^2 ^2 ^2 * • • • • ( 1 ^) 
-^3 ^3 ^3 Ca- 

serne writers define these as the determinants of this matrix: 
it is preferable, however, to attach a sign + or — according as 
an odd or even column is suppressed. Thus x, y, z, t are respectively 
proportional to the determinants A, B, C, D of the coefficient matrix, 
namely 

A = (bed), B~ — (acd), C — (abd), — (abc). (15) 

5. Linear Combinations of Bows or Columns. Number Field. 
Rank. 

It is convenient to have a precise notation applicable to 
matrices, determinants, and systems of equations. A few examples 
suffice to explain it. Consider the array 

abc 
X y z 

a-\- X b-{-y c-\- z (16) 

Here we obtain the third row by adding the elements of the 
two other rows columnwise. This is denoted by 

rowg = rowi + rowg (17) 

Again consider the array of four columns, 
a b c 

X y z x+y+z. ... (18) 

Here is an example of adding row-wise. We denote it by 

coll + colg + C 0 I 3 = co] 4 . . . . (19) 

Next the two arrays 

a 6 I'c a qa 

pa pb pc b qh 

c qc 



I-] 


LINEAR DEPENDENCE 


9 


exhibit what is meant by multiplying a row or column by a given 
number. We write these 

rowg 'p rowi colg q coli 

respectively. 

In general, by 

row^ + q row 2 -f- r rowg .... (20) 

is meant: form a new row by multiplying the first by p, the second 
by q, the third by r, ayid (iddimj. Similar remarks apply to 
columns, but only the former process applies immediately to 
equations, as for instance in reaching result (17) of the last 
section from (IG). 

We shall assume that all the symbols a, b, c, x, p , , . , hitherto 
used, stand for real or complex numbers. It follows that the 
process (20) implicitly includes subtraction as well as addition 
of rows, since one or other of p, q, r may be real and negative. 

At this stage it is useful to have a clear conception of what is 
meant by a field of numbers. This can be defined as follows. 

Definition of Number Field. — A class of two or more complex 
numbers forms a field if whatever two equal or unequal members 
p and q are chosen then p -f q, p — q, P X q, p -i- q are them- 
selves members of the field, excepting the case q — 0 in the quotient 

p-^q* 

It follows that integers do not form a field, because for instance 
2 ~ 3 is excluded: but rational numbers form a field. So also 
do real numbers. So also do numbers of the ty})e a -\ -bs^/b where 
a and b are rational. So also do complex numbers. It also follows 
that zero is a member of every possible field, by taking p and q 
equal in p — q. 

Suppose we now prescribe a definite field F for the numbers 
p, q, r of (20). By this procedure we are said to form a linear 
combination of the roivs or columns in question. 

Obviously if p, q, r are all zero we should form a row of zeros 
from 

p rowq -f- q rowg -j- r rowg. 

But, excluding this case, suppose we still get a row of zeros, i.e. 
a null row, when not all p, q, r vanish; then we term the several 
rows so combined linearly related or linearly dependent in the 



10 


MATRICES AND DETERMINANTS [Chap. 

field F, The same definition applies to columns. And if we 
cannot get a null row unless all p, q, r vanish, the several rows so 
combined are linearly independent in the field F, Likewise for 
any number of rows, and columns. 

This distinction between linear dependence and independence 
is of the utmost importance, and should be carefully thought out 
with these simple cases, in order to pave the way for its more 
elaborate use at later stages. 

We can now utilize this discussion of linearity to define the 
rank of a matrix in general. The rank of a square matrix is 
the greatest number of its rows or columns which can be found to 
be linearly independent. That of a rectangular matrix with fewer 
rows than columns is the greatest number of its rows which are 
linearly independent. If there are more rows than columns, the 
same test applies to its columns. 

So for the m X n matrix the rank may be any whole number 
0, 1, 2, . . . , r not exceeding either m or n. It will be shown in 
Chapter V that this definition amounts to the same thing as that 
already adopted in §2. 

6. Linear Equations which are not Homogeneous. 

The solutions of the homogeneous equations in either three or 
four variables already treated give the ratios but not the exact 
values of the variables x, y, z, . . , . In general n such equations 
f or + 1 unknowns x, y, z, . , , , t determine the ratios 

x:y:z:,..:t 

in terms of the coefficients. It follows that if one of these, the 
last, t, for example, is given in value, the rest can be determined. 
It is useful to take ^ — 1, for when this is done we can at 

once write down solutions as follows. The binary case has two 
equations for two unknowns x, y : the ternary case has three for 
three unkno>\Tis. 


(1) The Binary Case, 

If = ] 

a^x-\-bzy=--c^ J 


(tt6) 0, 


jg _ y _ 1 
(c6) (oc) {ah) 


SO that 


(o6j’ , {ah)' 


then 



L] NON-HOMOGENEOUS EQUATIONS 

(2) The Ternary Case, 

If a^x + hyy + c^z ] 



«2* + i22/4- 

C2Z- 

“ ^2 r 

(abc) 4- 0, 


a3*+ ^32/ + 

CqZ~ 

= d, 1 


then 

x 

y 

z 

_ 1 


{dbc) { 

[adc) 

{abd) 

{abc] 

so that 

a 

11 

y = 

{adc) 

i! 

i 


(abc) 

{abc)' 

{abc) 


It is worth while noticing the simple manner in which these last 
fractions, giving x, y, z, are formed. Each denominator is the 
determinant formed from the left-hand side coefficient matrix 
of the given system of equations. This matrix is now a square 
array. The numerator of x is obtained from the same determinant 
by suppressing the first colunm and substituting the column of 
(Z’s in its stead. By substituting in the second and third columns 
of (a6c) similarly, we obtain the respective numerators of y and 
z. This device overcomes the difficulty (cf. (14) ) of affixing the 
sign. It has the advantage of perfect generality, for it applies 
equally well to n equations involving determinants of the nth 
order. This rule was first given by Cramer in 1750. 

7. Condition of Solubility. 

These equations are soluble, as we see, unless in the binary 
case {ah) = 0, and in the ternary case {abc) — 0. A similar test 
holds for more variables. Thus the equations in the ternary case 
are soluble if the rank of the square matrix 

h-^ 

^2 ^2 ^2 
- U3 63 Cq _ 

is three. It is interesting to examine the cases of failure, when 
the rank is less than three; but as our chief concern is with the 
soluble case we leave this aside. 



12 


MATRICES AND DETERMINANTS [Chap. I 


EXAMPLES 


1 . Write down the determinants of the following arrays : 



c 

c' 

1 



2. What is the rank of the following matrices? 

rl 2 3"] rl 2 3“] rl 0 O-i r 1 1 On 

4 5 6, 0 0 0, 0 1 0, -1-^-1 0 . 

L? 8 oJ 17 8 oJ Lo 0 ij Loo oJ 

Ans. 2, 2, 3, 1. 


3 . Solve, ioi x\y iz:t, the homogeneous equations : 

ax hy cz dt — 0, 
a^x + h^y -{- cH + dH = 0 . 

What is the rank of their matrix? 

Ans. 3 if at least three of a, 6, c, d are different; 

1 if they are all equal; otherwise 2. 

4 . The complex number 2 + 3i consists of two linearly independent 
parts 2, 3i in the field of real numbers, but not in the complex field. 

The same is true of every non-zero complex number. 


5. The rational numbers ^ together with all numbers such as 
p r \/5 ^ 

^ form a field (where p, q, r are integers, and q 4= 0). 

If five points are the vertices of a regular pentagon, prove that the 


ratios of all segments of all lines joining these in pairs belong to the field. 



CHAPTER II 


Fundamental Properties of the Determinant 

1. Derangements. 

In order to consider determinants more generally, and to make 
the exposition clear, we must now recall several fundamental facts 
of algebra. First the number of arrangements or permutations ^ 
of n different things placed in a row or column r at a time is 

„P,= n{n-l)(n-2)...(n-r+l)-~-~. (1) 

where n!=lx2x3x...Xw, which gives the number when 
all are arranged each time. Secondly, the number of combinations, 
or groups, of r things chosen from n different things is 



Thus a group of n things can be divided into subgroups, con- 
taining respectively r and r things, in ,^0,. ways, for this is 
only another way of describing the same process. 

Now consider the function 

abc . . . m 

formed by the product of rC different numbers a, b, c, , . , , 7n, 

^ Jacob Bernoulli (1654-1705) first used this word in this sense: Ars con- 
jectandi (1713). Factorial n was introduced by Kramp (1808). The brilliant 
achievements of a young French mathematician, Pascal (1623-62), set this 
theory a-going. He discovered the number as (a -f 1 ) (/^-f 2) . . . 7w/(w — n)\, 
and initiated the mathematical theory of probability. Moreover, it is interest- 
ing to notice that Pascal’s results followed from the study of a matrix 

1 1 1 1 ... 

1 2 3 4 ... 

1 3 6 10 ... 

1 4 10 20 ... 


Of. Rouse Ball, History of Mathematics (London, 1901), p. 294. 

13 



H 


FUNDAMENTAL PROPERTIES 


[Chap. 


Here is an example where the arrangement of factors is im- 
material : all the n ! ways of arranging the row b, , . , , m are 
equivalent. But consider next 

^ .... (3) 

defined as a function of numbers a^, • • • ^ ctny • • • > 
which the summation sign indicates n\ terms, obtained by i)er- 
muting a,h,c,..,^ m in all ways without disarranging the suffixes. 
This function is called the permanent of the square matrix of 
order n 


■«! 

W 

<h 

. . . Ml " 


<^2 

h 

C2 

... M2 

... ( 4 ) 


K 


. . . 



Such a function is easily constructed in any particular case. It 
is useful to us in paving the way for a better grasp of what a 
determinant is. 

If g, r, . . . , ^ are the n given numbers, a, 6, c . . . , m, in 
another order, so that 

is a term of the series (3), we caller ... ^ an inversion or derange- 
ment ^ of abc . . .m. We also have a special symbol 



to mean the substitution oi p, q, r, . . . , t respectively for a, 6, c, 
. . . , m. It should now be clear that 

^Piq2h---fn 

means exactly the same permanent as S still having the 
same meaning. 

Next, by transposition is meant the interchange of two of the 
n letters without deranging the other n 2 letters. If these two 
letters are adjacent in the row the process is called an adjacent 
transposition. Thus 

abcde, ahdce^ adbce, dabce, dacbe 

^ Cramer’s word (1760). 



II.] 


THE AND CLASSES 


IS 


represent terms derived by adjacent transposition in succession, 
whether read from left to right or right to left. Manifestly any two 
terras of can be connected by such a chain involving adjacent 
transposition, and the process can be carried out in many ways if 
many letters are involved. 

Theorem. — Any transposition is equivalent to an odd number 
of adjacent transpositions. 

For if the transposition interchanges p and q between which 
k letters stand, it is equivalent to 2h + 1 adjacent transpositions 
caused by shifting p through k-\- I places until it is just j)ast q 
and then shifting q through k places back to where p first stood. 

2. The and G_ Classes. 

Theorem. — All the n! arrangements of n letters abc . . . m 
may he sorted into two classes and C_, such that an even nmnber 
of transpositions applied to any arrangement does not alter its class 
whereas an odd number does so. The class is taken to include 
the original arrangement abc . . . m. 

In the example just cited the terms are as follows: 

Cj^ abede, adbee, dacbe, 

(7_ abdee, dabce. 

Proof . — 

This consists of two parts, first to show the practicability 
and next the unambiguity of the classification. First, if 

/12 3...n\ 

\ijk...l) 

denotes a substitution wliereby a new arrangement ijk ... Z is 
derived from the n integers 1, 2, 3, . . . n, we may prove the possi- 
bility for these integers and then apply the same classification to 
n letters (or anything else capable of orderly arrangement). We 
count how many in the lower row precede 1, how many greater than 
2 precede 2, how many greater than 3 precede 3, and so on. Then 
ijk ... 1 is placed in 0+ or C_ according as the total count is even 
or odd. As this counting tallies with adjacent transpositions the 
classification is practicable. 

Next it is unambiguous. This is proved by showing that 
no two arrangements 1\ = 123 . . . n and T^ = ijk . . . Z are ever 
connected both by an odd and by an even chain of adjacent 



i6 FUNDAMENTAL PROPERTIES [Chap. 


transpositions. Let the supposed chains be given by terms 


^ 1 . ^ 2 > 


T T • 

rpt rpt rp 

*‘—1 > r > 


where consecutive terms differ only by adjacent transposition. 
Form the chain from to by linking these chains at T,., 
thus: 

'P ^ ^ ^ 'pf 7 ^' 7 ^' 7 » 

1 j 2 > * • * r— 1 » r 9 iff «— 1 > • * • 2> 1 * 

If T 2 differ from its predecessor by interchange of p, q, some later 
term T,^. differs from what immediately precedes it by interchange 
oiq,p: otherwise the original order as in could not be finally 
reached. If several terms have this property, we choose that 
nearest to Pair off T„T, and start again on the first unpaired 
term after repeating the same argument. In this way all 
the series except the first is paired off. Hence either both 
chains are odd or both are even. This proves the theorem. 

Example. 2314 

^2 =3214 
T 3 =3124 
=3142 
T 5 =3412 
Tg = 3421 
= 4321 
Tg =4231 
Tg = 2431 
Ti„= 2341 
Til = 2314 

Here two even chains connect Tj and T,.. We pair off, in order, 
rows 2, 8 ; 3, 6 ; 4, 9; 5, 11; 7, 10. 


Beciprocity or Duality. 

This theorem applies at once to a permanent; nor does it 
matter whether it is developed by fixing the suffixes and deranging 
the letters, or fixing the letters and deranging the suffixes. Thus 
when n = 2 


Ojhg + ~ + ^1 ®2- 



IL] 


DETERMINANTS 


17 


Everything, in fact, that has been said of the two classes 
and C_ will hold of either suffix or letter permutation. But if we 
combine both, we obtain rather a different state of things; there 
would be in all n\ X n\ terms giving the function (f) exactly n\ 
times, each term of <f) occurring n\ times. 

We may find to which class such a term as 

Pi. • • • h 

belongs by adding the total count among jHir . . . to that of the 
suffixes ijk . . . i, for this tallies with recovering the order abe . . . m 
first and then that of the suffixes 123 . , ,n, 

3. Definition of Determinant. 

The series of n ! different terms 

S ± 

wherein suffixes alone are permuted and the sign of the term is 
given by the class C 4 . or C_ to which it belongs is the general deter- 
minant of order n. It is often written 

{abc . . . m), 

or I ai 62 ^3 • • • ! > 

or more expressly 

61 Cl ... mj 

U '2 ^2 ^2 • • • ^^^2 

A = 

C^n K 

The leading diagonal term 

a^h^c^ . . . 

has a positive sign (apart of course from special negative values 
among its n factors). 

a 

Let 2 denote the generation by interchanging letters, and 2 
that by interchanging suffixes. Then ‘ 

a 

A = 2 <^^ 2^3 • • • 

= S ^ (1262^3 •• • 


1) 884 ) 


3 



i8 


FUNDAMENTAL PROPERTIES 


[Chap. 


Let us write the general term of S 

i 

...mi, 


where is ±1 according to the class of ijk . . . 
times denoted by 


sgn 


a 23 
\i j k 



This is some- 


Now the interchange of ij shows at once that 



is opposite to The same remark holds of the interchange of 
any pair i'/ in the lower row. Applying this to A, if any two 
letters a, c are interchanged in all terms of S we obtain a series 

i 

Xiiy where each term Uy is equal and opposite to a term in S. 

i 

Also any two such terms Uy^ Uy of Ew y must differ, otherwise 
they would be also equal before the interchange. Thus Stry 
exactly tallies with —A: whence 

. If two columns of A are interchanged, A changes sign. 


For like reasons 

If two roivs of A are interchanged, A changes sign. 

Let j), q be two letters or two suffixes. In the substitution notation 
these last results are written 



Should the elements of the rows, or columns, in question, be 
identical, the left-hand side of this last equation leaves A un- 
changed, so that 

A == — A, so that A = 0, 


whence, A vanishes if two columns {or rows) are identical. 

In particular, if each element c,- of A is unity, the above 
holds, so A vanishes. But now, on referring to the definition, 
we find each term of A is i 1. Hence the number of and 



EXPANSIONS AND CO-FACTORS 


II.] 


19 


of (7_ terms is the same. But the total number is w!. Hence 


Each class C^. and C_ has 


n! 

2 


terms. 


An example of all possible interchanges of columns of A was 
given in (10), p. 7. There will be n\ ways of writing A in general 
by such interchanges of columns, together with n\ ways by 
interchanging rows. So for n — 3, if A = | | 

I «1&2C3 I = I I = I 1 = — 1 I = — 1 02^1 C 3 I 
= — 1 I == I ^2«iC3 1 = 

In this way any of the elements may be brought to occupy the 
first place originally filled by %. 


4. Arrangement of Terms in the Expansion of a Determinant. 
Co-factors. 

Let us write A — + • • • zt T,. + . . . i 

where the chief term ma^b^ - • . and the number of terms is 
n !. This series is very unlike most of the familiar series of elemen- 
tary analysis, for here there is real difficulty in deciding on a 
natural order of its terms. There is no such thing as a general 
nth term in A, as every term is in a sense a general term. But let 
us agree upon one particular order as follows. We fix the order 
123 ... of suffixes in each term, and then arrange the terms 
alphabetically.^ This not only gives a unique order of succession, 
but automatically cuts the series of n ! terms into n equal sections 
of (n — 1) ! terms, exactly like the A, B, C ... sections of a 
dictionary. So we write 

A = + • • • + • (5) 

for % is a factor of all the first (n — 1) ! terms, b^ of the next, and 
so on. The capital letter factors are called co-factors of the re- 
spective small letters. But we may equally well take the suffix 
order fixed as ijk . . . /; so we likewise obtain, if i = 1, 2, . . . , n, 

A — (liAi b^Bi . . (6) 

which defines co-factors of elements of the ith row. Thus Ai is 
co-factor of a,-, Bi of 6,-, and so on. 

^ If n > 26, define this as ahc . . . za'b'c ' . . . z'a''b''c '' . . . &c.! 



20 


FUNDAMENTAL PROPERTIES 


[Chap. 


Correlatively we may fix the letter order abc ... m of a term 
and write the suffix sets in ascending order, reading the suffix set 
as an ordinary number in a sufficiently high scale.^ For n = 3 
this gives 

^^3^2 ^2 ^1^3 ^2 ^3^1 ^3 ^1^2 ^'3 ^2^1* 

It leads in particular to the development 

A = + ^2^2 4" %-^3 + • • • “I' • C^) 

and in general to 

A = + ^2^2 + ^3^3 + • • • + • • (8) 

where e denotes any of the n letters. 

In the above we have expanded A by a row or by a column. 
In each case the co-factor, typified hy Ej, is a determinant of order 
n — 1. 

For it contains all the (n — 1) ! terms obtained by permuting 
either letters or suffixes not represented by e, or i, and the charac- 
teristic alternation of sign accompanies the derangements. Hence 
the definition of a determinant is satisfied. 

In particular 


b. 

C2 

. . . m2 

bz 

C3 

... m3 

bn 

c„ 

... mn 


for a^Ai contains the term 0^)202 . . . which has the sign of the 
first term (the chief or leading diagonal term) in this last deter- 
minant. This is most easily remembered as the result of sup- 
pressing the row and column of A intersecting at a^ 

Unfortunately this last device would cause confusion in finding 
co-factors of other elements, because the sign of the resulting 
expression may be wrong. It is therefore useful to have a special 
name for the determinant so formed by suppressing row i and 
column e. It is called the minor of e^. Let us represent this graphi- 
cally. 


e 



* If n < 10 the scale of ten will do. 




IL] 


FIRST MINORS 


2t 


This process has not disturbed the orders of the rows and columns 
retained. Each such determinant of order n — 1 obtained by 
suppressing a row and a column of A is called first minor. So 
A will have first minors. They are given by the elements of the 
matrix 


+ Ay, —By, ~\-Cy, —Dy . . . 

— ^ 2 » " f '^2 * • • 


(10) 


with alternate signs like the pattern of white and black squares 
on a chess board; white + 3»nd black — . For it is obvious from 
the mode of definition adopted for co-factor and for minor, that 
they only may differ in sign, whereas this sign is determined by 
counting the adjacent moves, say right and downwards from 
position Ay to that of the element in question. Thus the minor 
of Cg has a negative sign because it is three such moves from 
position ay. 

If n Greek letters a, y, ..., fi with n suffixes are used to 
denote such minors we may write the determinant A in still more 
ways 

A OyOy — a2®2 + — • • • 

&c., 


exactly corresponding to the expansions by co-factors 
A — (l\Ay + hyBy + OyCy + . . . 

A = (^lAy + <^2^2 "i" 4~ • • • • 


That these are actually equivalent is best seen by noticing that 
the minor of by is by definition [ . . . m„ |, while the terms of 

A involving 6^ must be given by 

I . . . /»„ j 

with the negative sign, since the suffixes are in natural order 
\vhile b and a are interchanged. 

Thus cofiadors and first minors are numerically equal but dififer 
in sign according to the scheme (10). 

An example of the us^. of minors with the charactejii^Jgari^ep 
nate signs is given in (13), p. 7. 



22 


FUNDAMENTAL PROPERTIES 


[Chap. 


EXAMPLES 

1 . In Cj 

O'i 62 C2 
^3 ^3 C3 

the co-factor of Oj is — 63C2 which is the same as its minor. But the 
co-factor of 63 is — (ajCg — since the determinant can be written 

63 <13 C3 

- 61 ai Cl . 

bn Cn 

2 . Expand (abc) by its second column; and also by its third row. 

3 . Prove “{■■■ &2‘^2 ^3-^3 - 0. 

4 . Prove (paj + qa2)A^ -f (p&i + #2)^3 + (pCi + qcn)C2 = 0 . 

5 . What is the sign in the scheme (10) at the ith column and Jth row? 
Ans. ( — y+J. 

5. Laplace’s Development of a Determinant. 

Just as the full expansion S ± of a determinant A 

arises from the n\ permutations of n different suffixes 1, 2, . . . , w, 
so also special forms of the expansion are found by considering 
the modified set of permutations of n things when r are alike of 
one kind, s of another, t of another, and so on. This number of 
permutations is known to be 

n\ /' n 

Hi! V> 

where + = Ih particular, for tw^o kinds, r, 

n — r, it is 

/„x 
V. */ 

To fix our ideas, consider the kinds as white and black: the first 
r things being white, the following n — r things black. Thus, as 

regards colour, there are only two different things, and only 

colour arrangements of the n original things; but using a stricter 
criterion, each colour arrangement subdivides into r!(n — r)! 
different arrangements, when each individual thing is regarded 
as different. 

Further, we may imagine all the colour arrangements of the 
n things first made, and then subdivided, so that we can think of 



n! 

r\{n — r)\ 



LAPLACE’S DEVELOPMENT 


23 


IL] 

the original n\ arrangements in their colour order; namely the 
first r ! (w — r) ! of these arrangements belong to one colour order, 
the next r ! (n — r) ! to another, and so on. 

For instance, taking three things, one white and two black 
62? 63, there are in all six arrangements 

derived from three colour arrangements 

wbb, bwb, bbw. 

For five things 63, 64, 65, two white and three black, there 

are ten colour arrangements, say 

wwbbb. . . . , hbwbw, bbbww, 

and twelve ( 2 ! 3 !) subdivisions of each. The twelve subdivisions 
of umbbb are 


^^4^2636465 

^2^1^36465 

^2 63 65 64 

M's Wi 63 65 64 

w^iv^b^b^^b^ 

W2W1646365 


UK^u\bnh^h^ 

^ 1^2 ^5 ^ 3^4 



w^Wib-^b^b^., 


Manifestly the subdivision of a given colour arrangement can 
be made partially, black first and then white, as in the above 
scheme read by columns, or white first and then black, as in the 
above scheme read by rows. These partial subdivisions go on 
independently because they each only affect arrangements entirely 
within a colour group. 

Now let this arrangement be made of the original terms of 
the determinant, where the first r suffixes are called white and the 
next n — r black, the letters a, 6, . . . , m being fixed in order. 

We first have ( ) colour arrangements, which are next subdivided 

into an array of (h — r ) ! rows and {\ r\ columns, each column 

containing permutations of the black but not the white. 

Since the letters a, b, , m are fixed in order, and the black 
sufiixes alone are deranged in a column, each inversion being 
accompanied by a change of sign in the term, it follows that the 



24 


FUNDAMENTAL PROPERTIES 


[Chap. 


sum of terms in a column is a determinant say, of order 

(n — r) multiplied by the product of r elements whose suffixes 
are white. Also this determinant will appear in each of the 
r! columns of the same original colour arrangement. Summing 
such r! columns we obtain the sum of these coefficients, which for 
the same reason give a determinant E^ of order r due to permuta- 
tion of white suffixes. Thus each colour arrangement gives an 
array of terms whose sum E,.E,^_^. is a product of two deter- 
minants 


E,i-r — 1 9 1+1 r + 2 • • • /t 1 J 


. (13) 


where a', 6', . . . , . . . , m' are the letters a, h, . . . , m in 

some order. For the letter order has been deranged by factorizing 
the terms of one colour grouping. Hence the original determinant 

is expressed as a series of terms 

2 


by rearranging the whole series of n\ terms in the manner ex- 
plained. But inasmuch as each term of S E,,E^^^^ now has its 
suffixes in the original order, the terms are derived from one 
another by applying what has been called the colour permutation 
to the letters ab , , , m instead of to their suffixes; for this 
is the effect on the letters of arranging a typical term of 
each colour grouping (12) in ascending order , of its suffixes. 
And finally if we examine the chief term of E^ and of 
which is 

a'16'2 • • •/'.• X (7',+, A',+2 . • • m'n, 

we see that it is a term of the original determinant with letters, 
not suffixes, deranged. We infer that, if n > 2, 

m' 

belongs with ab . . . fgh . , .m to the 0^ class. This comiiletely 
specifies the sign of the term and we finally write 

I Ojfcg . . . m„ I = [ 0'16'a . . ./V I ! g'r+i A', .+-2 . . • ot'„ 1 , ( 14 ) 

A = S£,.E,._, 


or 



II.] 


LAPLACE’S DEVELOPMENT 


25 


This is called a Laplace ^ ^ development of the determinant 

by its first r and next n — r rows.^ 

Example . — 

I a^h^c^d^ I == I a^b^ | j c^d^ j -[- | a^c^ \ \ dyb^ [ + 1 «i<^2 1 I ^3^4 1 

+ 1 ^1^2 I I « 3<^4 I + 1 h^i I I <^ 3«4 1 + 1 (^idi I I 0364 j. 

Various corollaries immediately follow. For let ij .. .q he a, 

arrangement of the suffixes 1 , 2 , . . . , w. Then 

I . . w„| =- I aibj...m,j \ 

= Sla',hV..|l...m'J 

say. This gives a Laplace development by any assigned r rows 
a 7 id the complementary n — r rows. 

Similarly we may fix the letter order in the Cj^ class and 
permute the suffix order. This gives a Laplace (r, n — r) develop- 
ment by columns, also denoted by (r \ 7 i — r). 

Once more, by using three or more colours in the original 
permutations we may make a Laplace (r, s, t . . . ) development by 
rows or by columns, 

.... ( 15 ) 

a series of n \ j r \ s \ t \ . . , terms. 

EXAMPLES 

a 

1. Expand | aib-^c-jd^ | as S | a,bj 1 ] adi \ where ij, kl = 13, 42: also 
wliere ij, kl --- 41, 32. 

a 

2. Expand | ajb 2 C 3 (he^\ as | | | ; also as Eui | 62C3 | | ^465 1. 

i 

3. If I «if>2^:A^5/6 I i® expanded in various developments but without 
regard to the sign of the term, what sign should be attached to each of the 
following? 

I ^2^4 I 1 !? I I I b^c^ei |, [ | | I • 

Ans. — , — , 4-. 

4. Show that the ordinary expansion S jz ... of j • • • | is 

a particular case of a Laplace development. 

5. Show that the expansion by a row or a column, c.g. is a 

Laplace development. 

1 Dating from 1772. Cf. Laplace, (Euvrea, VIIT, 366-406; Muir, History, I, 
p. 24. 



26 


FUNDAMENTAL PROPERTIES 


[Chap. 


6. Algebraic Complements and Minors of Order r. 

If A == S jBy is a Laplace development where each term 
is positive, the factors are sometimes called algebraic 

complements of each other. 

For instance in | | the algebraic complement of is 

In I [, that of j a^C 2 1 is | d^b^ |. 

Definition of Minor of Order r. — The determinant obtained by 
suppressing any n — r rows and any n — r columns of A is called 
a minor of order r. 

It is also called an {n — r)th minor, in agreement with the 
first minors already introduced, where n— r = 1. 

It is best to extend this definition so as to include, as such a 
minor, a determinant made by any derangement of rows and of 
columns. Hence we regard the two determinants given by 

±1 ft ?,- •••«*! 

as minors of order r, where p, q, . . . , s are any r of the n letters 
afb,...,m and j, . . . , k are any r of the suffixes 1, 2, . . . , n. 

In this way both E,. and E,^^^ are minors, and when their 
product is a term in a Laplace development of A, they are called 
complementary minors. Their other name, algebraic complements, 
is used also in a different sense: namely, if 

A = 2 J?,. E,^_^ = Ti \ a lb 2 • • • f r \ \ 9'r+i #- 1-2 • • • 1> 

the partial letter rows a'6' . . ./' and g'h' . . . m' in this order are 
called algebraic complements of each other. 

The phrase is used to specify two complementary letter (or 
suffix) sets in such cases as the Laplace development. Clearly it 
is relative to a given natural order. Thus, relative to the order 
1234, possible algebraic complements are 


1 , 234 

12, 34 

34, 12 

2, 314 

13, 42 

42, 13 

3, 124 

14, 23 

23, 14 

4, 213 




but not 234. I . 



ii:\ 


DETERMINANTAL PERMUTATION 


27 


7. Determinantal Permutation. 

Definition. — The arrangements ofn letters ab . . . f, gh . . . m 

where r precede and n — r succeed the comma according to the 
(r : n— r) Laplace development is called a determinantal permutation 
of the n letters. It is denoted by 

ab ... f, gh . tn. 

For example we should write 
• • • 

a, be — a, be, b, ca, c, ab. 

When this notation is used on arguments a, 6, c of a function 
of a, b, c it is understood to mean the sum of the functions obtained 
by making these permutations. 

The dot placed above a symbol (letter or suffix) indicates that 
it undergoes permutation. The notation evidently gives a compact 
way of writing a Laplace development. Thus 

where a series of 5 ! /2 ! 3 ! terms is indicated. Similarly for further 
subdivisions, including the original expansion. So 

A = I 1 ~ ^ 

= a^-^b^c^d^e^ 

= I «1^2C3 1 1 d^e^ I 

~ I ®1^2 1 1 ^3^4 1 ^5> 

&c. 

We may therefore speak of an (r, s,t, . . .) determinantal permuta- 
tion of r -}- 5 + ^ . . . things. But if r = s = ^ — . . . = 1 we 
must utilize a negative sign to specify certain of the terms. Thus 

a, b,c^ a, b, c, —a, c, 6, —6, a, c, b, c, a, c, a, b, — c, 6, a. 

Also, as an alternative to the a, be above, we have 

• • • 

a, be = a, be, — 6, ac, c, ab, 

and so on. 



28 


FUNDAMENTAL PROPERTIES 


[Chap 


As further examples of the notation we may have 

I I i ^1^2 1 =“ I ^1^2 1 I I I ^1^2 1 1 |> 

sin {A — B) ~ sin A cos B. 

EXAMPLES 

1. Prove {d — c) (rf — h) (d — a) (c— h) {c — a) (6 — a) 

1111 
a b c d 

c2 d^ * 

63 c3 d^ 

This determinant is called an alternant. 

2. Give the corresponding identity for an alternant of orders 2, 11. 
and n, 

a 

3. Prove the rule of signs for the determinant 
sidering the alternant. 

4 . Resolve into factors 

1.11 

a 1 c d 

a2 2 a c2 d^ * 

a3 3a2 c3 d® 

5. Prove by resolving into partial fractions that if 

f(x) = pofc^' + 1 + . . . + pr, r < ?i, 

M 

{x — Aj) {X X2) . . . (iC X/i) 


1 , 

1, .. 

1 


1 , 

1 , .. 

1 


X2> • • 

. , Xh 



X 2 , . . 

. , Xn 


\ n—‘l 

A 2 9 . . 



Xx'*-2, 

V-®. •• 

V‘-2 

f{\) 

X — Xj’ 

nh) 

a; — X 2 ’ 

f(^n) 

X — Xn 


V‘->, 

V*-', •• 



6. If the denominator (a; — Xj) . . . (a: -- Xn) of the preceding example 
is written g{x), express the integral of a rational function 



as the quotient of two determinants of order n, 

[Replace each S('ki)l(x — Xi) of row/i above by /(X/) log(a; — X/). 

7. Evaluate f{x)lg{x) and / \f(x)lg{x)\dx as quotients of determinants 
in the case when g(x) has repeated factors. 

[Replace colg of both determinants by columns obtained from col^ by 



ROTHE’S THEOREM 


IL] 


29 


differentiating with regard to Xj as in Ex. 4. This covers the case when 
X 2 == Xi alone. Further, such differentiation solves the problem for higher 
repetition. 


8. If in the alternant A — | | of Ex. 1, a and h are conjugate 

complex numbers, r(cosa ± i sin a), prove 


A=:2i 


0 111 
r sin a r cos ol c d 
r® sin 2a r* cos 2a c® d^ 
sin 3a r® cos 3a c® d® 


[Use coll C 0 I 2 , coll -j- colg. 


9. Adapt Ex. 5 to the ('.ase when Xj, Xg are conjugate complex numbers. 


10. The first n integers are deranged and written also in their original 
order, so: 


4 3 2 5 1 


1 2 3 4 5 

If the n pairs 11, 22, . . . nw are joined by lines, curved if necessary to 
avoid multiple intersections, prove that the number of intersections 
determines the class C+ or G- of the upper derangement. 

[Ait ken, 

11. Mothers theorem on conjugate permutations. Two derangements 
are conjugate if the element and place occupied in one become the place 
occupied and element in the other. Show that the conjugate of 43251 is 
53214. 

Further, show that the scheme 

1 2 3 4 5 


5 3 2 1 4 


has the same pattern of intersection lines as in Ex. 10. 

Hence prove that two conjugate qyermutaiions belong to the same eJass, 

[Aitken, 

12. A derangement is self conjugate if its conjugate is itself. Prove 
that its pattern is symmetrical about its horizontal bisector. 

13. By considering such symmetrical patterns of n, w — 1 and m — 2 
columns, prove that the relation 


connects the number of self conjugate patterns 
things respectively. 


of r?, 71 — 1 and n — 
[Rothe — Aitken, 


Cf. Muir, History of DeterminardSy 1 (1906), 60. 


o 



CHAPTER III 


Linear Properties. Fundamental Laplace Identities 

1. Linearity. Homogeneity. 

The w-rowed determinant A = | I ^ linear 

function of the elements of any row or column. So many pro- 
perties of A hang on this that it is worth explaining in some 
detail. 

To begin with, the notation /(a?) is used to denote o. function 
of a single variable or argument x\ while a? 2 , 
denotes a function of n different arguments. If these arguments 
can usefully be called a set, or one-rowed matrix, as when they 
serve as co-ordinates of a point, we frequently contract this 
notation and write 

f{x,)=f{Xi,X^,X^, x^). ... (1) 

We may even drop the suffix and write simply 

/(*)• 

This is the contracted functional notation fora function of a specified 
set of arguments 

X = [x^, X J (2) 

The function is homogeneous and of order p in its arguments 
if, and only if, 

it^/(xi, X2, . . . , xj ==/(ifcxi, ixg, . . . , kxj (3) 

identically for all values of Jc. Thus if ag? • • • > 
dependent of x, the function 

aiXi + a^x^ + . . . + a^x,, ... (4) 

is of order unity. It is a linear homogeneous form in n arguments 
X. We write this in various ways, for example, 

n 

Sa,®,-, (ol®), (ax). . . (5) 



Chap. IIl.J 


LINEAR PROPERTIES 


31 


Now the fundamental property of a linear form is the sim- 
plicity of its addition theorem, namely 

which can be interpreted in the contracted functional notation. 
Thus if 

f(x) = ajaji •+- ajXj -f- . . . -f a„a:„ 

f{y) aiyi + «2y2 H- • • • + 

then 

/{® + y) == «1 -I- yi) + a2(*2 + 2^2) + • • • + a„ + «/«)• 

More generally if we multiply throughout by j) and q respectively 
and add, 

f{j)X -f qy) -= J)f{x) -f qf(y) 

where 

px + qy - j)x^ + qy ^ , px^ + qy^, . . . , + qy,,. 

An immediate consequence is the following theorem. 

The Ji^rowed determinant A is unaltered in value by adding 
to one of its columns any linear combination of its other columns. 
This is true also of its rows. 

Thus, by p colg + ? C0I3 + r C0I4, 

(I 2 + Ph + Q<^2+ 4 

ag + pb^ + qc^ "h ^^3 > ^3 > ^3 > ^3 
«4 + ?^4 + ?C 4 + hy ^ 4 » <^4 

= {abed) + pifbed) + q{cbcd) + r{dbcd) 

= (abed), 

because the other terms, each having an identical pair of columns, 
vanish. 

With the notation of (5) the following are very useful examples 
of this linearity: 

^px+qy “ V^x 4 “ > 

p and q being common factors of all n terms. 



3 ^ 


LINEAR PROPERTIES 


[Chap. 


2. Special Determinants. 

The following cases, the results of which can be easily verified, 
are worth noticing. 

(1) The unit determinant. It has unity for each leading diagonal 
element, and zero for each other element. 


1 . 


1 . . 

1 


1 . . . 

. 1 . . 

. 1 


• X . 

. . 1 


. . 1 . 

... 1 


The matrix of this determinant is called the unit matrix (§8, 

p. 68). 

(2) The value of the unit determinant is unaltered by filling 
up one triangle of zeros with arbitrary elements. 


( 3 ) 


1 


X 1 


I . . 

X 1 . 
y z \ 



ai . . . 

X • 

y ^ ^3 • 

V q r 




(4) A determinant of lower order can be expressed as one of 
higher order without disarranging its elements. 


a h 
c d 


a b X 
c d y 

. . 1 


a b X z 
c d y t 

. . 1 . 

. . . 1 


Thus the determinant 


a b 
c d 


is extended diagonally with the 


unit matrix, bordered on one side with zeros and on' the other 
with arbitrary elements. 


3. Double Suffix Notation and other Contractions. 

Hitherto we have used letters to distinguish columns, and 
suffixes for rows. This has certain advantages, but not such 



IIL] DOUBLE SUFFIX NOTATION 33 

as entirely supersede other notations. Let us now write 


for 


«1 


(Z3 

• • . 

a 

h 

c 

... m 


and for the letter of the ith row and the jth column: so that 
a typical determinant is 


'11 

^12 

®13 

^21 

^22 

^23 

31 

%2 



. . ( 6 ) 


We adopt this simple notation^ | a-J for the determinant A, 
and [Ufj] for its matrix, so that 



^12 

®13 

^21 

^22 

^23 


^32 

^33 


• . ( 7 ) 


In consonance with a previous use, A is sometimes denoted by 
{aia2a^), where stands for the ith column. 

As a rule there is no ambiguity in practice when the order 
n orf the determinant is unspecified, so that | a^j | is of whatever 
order immediately concerns us. Where doubt may exist the 
order must be clearly explained. 

A particular case of this notation is defined by 


18.1 


rs 1. S _ 

[8,J, 8.5 - (1,,.^^. 


. . ( 8 ) 


This symbol, which is called the Kronecker delta, characterizes 
the unit determinant and the unit rrMtrix. 


4. A Determinant is irresoluble into Factors, 

Regarded as a rational integral function of its n^ elements, 
a determinant has no rational factors. For suppose if possible 
that A = I a;j | can be written as the product of two rational 
factors d<f>. 

Since A is linear in each element, cannot occur in both 
factors By <f>. Suppose that it occurs in 6 , 

In the expansion of the determinant no term occurs in which 

^ Introduced by H. J. S. Smith ( 1862 ) and established by Kronecker. 

( I ) S 84 ) 4 



34 


LINEAR PROPERTIES 


[Chap. 

Oij is multiplied by any element belonging to its row or column. 
Thus ({> can involve no element belonging to the first row or the 
first column. Let be an element which does occur in <f>. By 
similar reasoning no element belonging to the rth row or 5 th 
column can occur in 6, 

Thus the two elements a,.i, cannot occur either in 6 or in 
But the expansion of the determinant involves every element. 
Our supposition that A can be written as a product of factors 
d<l> is therefore untenable. 


5. Rules for Combining Matrices. 

It is now the place to give the rules for addition and sub- 
traction of matrices. These rules, which are due to Cayley, turn 
out to justify themselves, although they contradict some of the 
corresponding rules for determinants. 

If a,! and bfj are corresponding elements in row i and column 
j of two matrices A and B, the sum of A and B is a matrix with 
Q>ij + b,-y for corresponding element This is the definition on the 
understanding that it is true for all values of i and j, so that 
A and B must be conformable; they must each have the same 
number n of columns and m of rows. 

A rule is sometimes given for addition, when the matrices 
are unconformable; but this case will not be considered. 

Let the sign [ij] placed after an equality mean “identically 
for all values of i and j then the m by n matrix C is the sum of 
A and B if 


c,^=na..4.6;. {{j} 


( 9 ) 


where A — B = [6,-^], C — [c,J. We now write this com- 
prehensively as C=A + B (10) 


Likewise for subtraction, we define 

C=A-B 

to mean 


{ij}- 


( 11 ) 

( 12 ) 


In particular we write A + A--= 2 A, so that 2 A denotes a 
matrix wherein each element of A is doubled: whence if r is a 
positive integer 


where every element is multiplied by r. 


( 13 ) 



IIL] ADDITION OF MATRICES 35 

Again, \i A == B then {i/|, while if -4 + B is the 

null matrix, == 0. Accordingly we write 

[tty] - K] = K- - tty] = [0] = 0 : 

also 

[«y] + [-«y] = K - ay] = 0. 

This in fact defines —A, namely 

^=[ay]. - Kl = [- ay]- • (14) 

The reader who has examined the theory of indices in 
elementary algebra will have no difficulty in extending the 
validity of relation (13) to cover cases where r is not merely a 
positive integer, but is negative (as in (14) ), zero, rational, 
real or complex. Let us call such values scdtar numbers to 
distinguish them from the entities A, B, C which are arrays of 
numbers, although they behave in many ways like scalar or 
ordinary numbers. This behaviour is summed up by saying: 

Linear combinations of matrices with scalar coefficients obey 
the rules of ordinary algebra. 

In fact we may prove without difficulty the following funda- 
mental identities: A = B implies B=^ A, 

A + B= B+ A, 

(A + B) + C=A + {B+C), 

rA-\- rB — r(A + B), . . (15) 

rA-j- sA~ (r+ s)A, 
rA = Ar. 

Each of these equations involving A, B,C is merely an abbre- 
viation for a set of mn equations involving a,;, b^j, c^j, where both 
i and j remain constant in each particular equation. 

We may even have relations linear in the matrices but not 
linear in scalar numbers. It would be true to say 

xA yB _ x{z~ x)A + y(y—z)B 
y — z z—x 

where x, jy, z are scalar numbers. What is at present excluded 
. is a product of matrices AB^ AC, A^, , . . , which will later be 
defined. 



36 


LINEAR PROPERTIES 


[Chap. 


The reader who is familiar with the use of vectors in one form 
or another will recognise that these laws are identical with the 
addition laws of vectors. 

Transposition of a Matrix. 

Definitions. — The matrices A — [ajj] and A' = [aji] are called 
the transposed of one another. Each is obtained from the other 
by interchanging its rows and columns. It is often convenien 
to denote the transposed of K by an accent, (Cf. Chap. I, §3.) 

The property is conjugate or symmetrical, and sometimes A' 
is called the conjugate of A, 

When transposition leaves a matrix unaltered, the matrix is 
said to be symmetrical: if transposition is equivalent to changing 
the sign of all the elements the matrix is skew symmetrical. 

When a matrix has a single row, or a single column, it is 
called a vector. Thus there are two distinct types of vector, 
the row vector, and the column vector. 


1 . 


then 


Again 


EXAMPLES 


1 

rl 

2 





2 

3 

- 

II 

4 

6 

6 

and 

J5 = 

4 

4 

6 

9 

1 

J1 

8 

oJ 



L? 

8 

8. 



r 1 

4 

6 - 

1 




rl 

0 


8 

9 

12 

, while A 

-B 

== 

0 

1 


..14 

16 

17. 

j 




Lo 

0 



r 

V 

9 

2p + 

2?. 

.3p 

+ 


pA + ( 


- 

4p + 

44'. 

5p + 

4?. 

6p 

+ 

6^ 



L 

7p + 

7?. 

8p + 

84, 

9p 


SqJ 


rl 4 7- 

If A' is the transposed of A, then A' = 2 5 8 . 

Ls 6 9- 


~a h g 

n r 0 

r q 


2. Q= h b J 

is symmetrical, while S — — r 

0 1 

) is skew 

-9 f c 

J L-q 

— p C 

L 


symmetric. 

A skew symmetric matrix necessarily has zero elements throughout 
the leading diagonal. 

3. Prove that in general the sum of a square matrix and its transposed 
matrix is aymmeiricalf while the difference is skew symmetrical, 

4 . Prove the determinant of the matrix equivalent to pA is times 
the determinant of A, if A is a square matrix of order n. 



III.] 


CURRENCY 


37 


rl 0 0-1 

5. If 1 denotes the unit matrix 0 1 0 * , then A -- "k/ is 

-0 0 1 - 


<*21 * ®22 — 



The determinant \A — kI\ is a polynomial of degree n in X, Likewise for 
.4 + XB I if .B is square and of order n. 

6. Prove {A + B)' = + B'. 

7. Prove A = (A')'. 

8. Any square matrix of order n has at most arbitrary elements. 
The symmetrical has \n{n~\-\) and the skew symmetrical Jn(w— 1) 
arbitrary elements. 


6. Currency of a Matrix. 

It is often very well worth while to group several columns 
as well as rows of a determinant in one s 3 anbol. Let us agree to 
use capital letters with suffixes for this purpose, unles's something 
is said to the contrary. 

We first write the w-rowed determinant 

• • * ^n—r) • • (f®) 

as 

(17) 

In full this is given by 



«u 

^12 • 



bi2 . 

• Ws 

A = 

«21 

(^22 • 

. a2r 

^21 

^22 • 



a, a 

a„2 • 


^nl 

^n2 • 



where s = n — r. Clearly this is a formidable expression, only 
to be used sparingly, while (16) is much easier to handle and 
(17) is even better still. In (16) each or denotes a column; 
in (17) A,., denote complementary oblong matrices making 
together the square which furnishes determinant (18). 

Definition of Currency. — The suffix r of A^. is the currency of 
the matrix [AJ for the field or category of order n. 

So the currency specifies the number of columns in 
In the same way we consider each a-, 6^ of (16) to have unit 
currency. They act as “small change” equivalent to two 



38 


LINEAR PROPERTIES 


[Chap 


“pieces” A^, of higher currency in the determinant whose 
contents is of total currency n. 

Next let A be cut just below the rth row and expanded by 

Laplace’s development. The result is a sum of terms 

63 . . . 


where a^, a2i are deranged, although the outside suffixes 

are fixed because they refer to rows. It is essential to have a 
ready way of alluding to this fundapiental operation, so we simply 


denote the sum of all the 


arrangements by these equivalent 


notations: either A,., or 6162 This is a 

case of what has been called (p. 27 ) a determinantal permutation 
of the r columns of the matrix A^. with the s columns of 


7. Transposition Properties of Determinants. 

The process just described can be carried further by parti- 
tioning one or both of Aj. and B^. Equally well we may partition 
a determinant into layers of rows, or even make a double partition 
by columns and rows. In this case we express a determinant by 
the matrices in the rectangular partitions. For example 

abed 
e f 9 h 
p q r s 
X y z t 


can be looked on as pieced together from four matrices in rect- 
angular array 


r ® 



di 

Le 

f\ 


h\ 

[p 




L® 

y\ 

L* 



forming a matrix of matrices. In all such partitioning the 
relative positions of the original elements of the determinant 
are maintained. Accordingly by 



A B 
D E 


(19) 



III.] TRANSPOSITION OF ROWS OR COLUMNS 39 

is meant a determinant whose elements are partitioned into 
four matrices A, B, D, E. If A, D each have r columns, B, E, 
8 columns, A, B,t rows, D, E, u rows, then 

r s — i-\- u— n, 


A being of order n. Now we can alter the expression for A 
in various useful ways. For by transposing all s columns of 
B, E to precede columns of .4, D we have 


A = 


A B 
D E 


- (-r 


B A 
E D 


. . ( 20 ) 


By transposing rows, we have also 


A = (-)'“ 


D E 
A B 


= (-) 


rs+tu 


E D 
B A 


Similarly for triple and higher partitions. 

The main use of this matrix notation occurs* when all the 
tnalrices have n rows, and determinants of orders 2 n, , ,jm 

are considered. 

For example, let R, S, T be matrices of n rows and r, s, t 
columns respectively, where 

r-\- s-\-t—2n. ’ ( 21 ) 


Then 


R S . 
R . T 


represents a determinant of 2 n rows and 2 w columns, the dots 
signifying arrays of zeros. Now A is unaltered by 

row„+i — rowi, ... row„+. — row,-, ... rowa,, — row„, 
or briefly 

lower matrix — upper matrix. 

But by definition of subtraction of matrices 

R-R=(i' o-s=-s, r-o=r. 

R S . 

. -S T 


A = 


Thus 


A: 


Similarly 


. S -T 
R . T 


. . ( 22 > 



LINEAR PROPERTIES 


[Chap. 


Further, on multiplying by — 1 each of the last n rows of the 
full expression summarized in (22), we change the sign of all 
the elements concerned, and so — jS becomes + S and T becomes 
— T. Hence 

A={— )" ^ ^ 

^ ‘ . S -T 


But T has t columns. So. on multiplying the last t columns of 
A by — 1, we obtain 

A =(-)“+' ^ • 

^ ’ .ST' 

/ \ ij. 4- / 4- »•« I ^ ^ • i 


as in (20). 

So the determinants 


R S . 
R . T 


S R . 
S . T 


in sign, and that only when w ^ is odd. 
Exactly similar reasoning shows that if 

R, X, M, . . . 


can only differ 


are p matrices of currency A, i, j, k . . . respectively, the deter- 
minant of order np 

R L 
R , 

R . 


apart frcm sign, is unaltered by writing any matrix Z or M or 
N , . . repeated in the first column and the rest diagonally as 
before. The only condition is 

h + i + j+Ic+ .\.=::{p--l)n . . . (25) 

to make the total number of eleme^Us (not matrices) the same in 
each row and column. 

The next section will illustrate this type of determinant. 



EXAMPLES 

1, li A, B, C, , , , are each two by two (or 2k by 2k) matrices prove 
A B\_\D C\ 

c a\ 


= &c. 



III.] FUNDAMENTAL LAPLACE IDENTITIES 41 

2 . li Ai B, C, » , , are each square matrices, then 

A . . 

. B . |J?| iCl. 

. . C 

3 . Prove 

A . . 

L B . = 1^1 \B\ \C\. 

M N C 

[Expand by Laplace’s method.] 

4 . If 

flj ^1 Wlj 

^2 ^2 ^2 ^2 

^3 ^3 ^3 ^3 ^3 ^3 “ (•^3'^2'^i) “ {ALX ) , 

^8 ^6 ^6 ^6 ^6 

show that (ALX) = (^XL) = - (XAL) = - (XLA) = (LJX) = - (LXA). 

5 . Examine the corresponding six permutations for (AiLjXu), 

H- i + A: = n. 

6 . Extend the linearity theorem at the middle of p. 31 , to the case when 
the letters «, 6 , c, d are replaced by matrices. 

8. Fundamental Laplace Identities. 

Laplace’s development leads to many important results in 
particular cases, some of which will now be given. They mostly 
depend on cutting the determinant half-way across and expand- 
ing by complementary minors of equal order. 

First expand the vanishing determinant 



yx 

Xl 

yx 

X2 

y% 

Xg 

yz 

Xs 

yz 


yz 

Xi 

yx 


yx 


then 

{a:«/)i2 (a;2/)34 + (^y)iz + (a^)u ( 2^)23 = ^ (^<>) 

identically. 

Correlatively since 

aj 61 q 

^2 ^2 ^2 ^2 Q 

^ 

^2 ^2 ^2 

then 

{he) {ad) + {ca) {bd) + {ab) {cd) = 0. . . ( 27 ) 



42 LINEAR PROPERTIES [Chap 

Several results follow from a determinant of the sixth order. 
Consider the identity 


bi Cl di . 


bi Cl di . . 

a^ 62 ^2 ^2 • • 


^'2 ^2 ^2 ^2 * 

a^ b^ c^ d^ . 


^3 63 c^ d^ . 

. . . di eifi 


ai bi — Cl . Cl fi 

• • • <^2 ^2 y*2 


0^2 ^2 ^2 • ^2 ^2 

. . . 6?3 C3 f^ 


^3 ^3 ^3 • ^3 y *3 


This is comprised in the single operation: Subtract the upper from 
the lotver half matrix in the first determinant. Expanding by 
Laplace’s method we have 

{ahc) (def) = (dbc) (aef) + {adc) (bef) + (M) {cef), (28) 

where all suffixes are 1, 2, 3. All the other usual terms have 
disappeared because of the zero columns in one or other factor. 

This is called a fundamental ternary identity^ in general 
form. But in particular if [d]~[f] (i.e. ” /i, d^—f^i 

^3 “/s) f^hen (def) — 0 and 

0 = {fbc) {aef) + (a/c) {bef) 4 {ahf) {cef). 
Rearranged this is 

0 = {bef) {aef) -f (caf) (bef) + (abf) (cef). . (29) 

This is an example of an extensional, for it reproduces at a higher 
order an identity (27) already known, merely by inserting a 
common letter / in each factor. 

The eighth order gives still more varied results by exactly 
the same device. We expand the identity 


Oi bi Cl di Cifi . . 


64 Cl 

di 

Cl 

/l 


. 

®2 ^2 ^2 ^2 ^2 f 2 • • 


^2 ^2 ^2 

d. 

^2 

h 


. 

% ^3 ^3 ^5 ^ 3^3 • • 


% ^3 ^3 

d^ 


fz 


. 

^4 ^4 ^4 ^4 ^4/4 * • 


^4 ^4 ^4 

d. 

^4 

h 


• 

. . . 


Aj Cl 

. 

. 

. 

9i 

h 

, . . ^2 ^2 /a 5^2 ^2 


1 

1 

to 

1 

. 

. 

. 

9z 

h 

. . . d^ 63 /3 5^3 A3 


Uq 63 C3 

. 

• 

. 

9z 

h 

. . • d^e^f^ A4 


a^ 64 — C4 

• 

• 

• 

9i 

h 



III.] FUNDAMENTAL LAPLACE IDENTITIES 
and obtain 

{abed) (efgh) + {abce) (fdgh) + (abef) {degh) 

= {defa) (begh) + {defb) (cagh) + {defc) {ahgh), . (30) 


All other terms disappear because of zero columns. Now it is 
useful to use the abbreviation as in §7, p. 27, for this last result. 
It is written 


{abed) {efgh) = {defa) {begh), . . (31) 


where the three dots placed over the letters of a term indicate 
the sum of three terms obtained by suitable derangement — in 
one case 

d, ef ejd f de, 

and in the other, 

a, be b, ea c, ab. 

Matrix Notation. 

As this eight-rowed determinant can teach us several further 
interesting facts about four-rowed determinants, let us make a 
natural abbreviation, writing 


a b e d e f , . 


a b e d e f . 

. . .d ef g h 


—a —6 — c . , g h 


as short for the above equality. In this last each letter stands for 
a matrix of one column of four elements. This enables us to form 
other possible relations by varying the number of repeated letters. 
Thus from 

a b e d e . . . 

, . cd ef g h 

it follows that 

{abed) i^gh) + {abde) {efgh) + {abec) {dfgh) 

= (oerfe) {bfgh) — {bede) (afgh), . (33) 

or, more succinctly, 

{ab<d){efgh)—{acde){bfgh). . . . (34) 



a bede,.. 


—a —b . . .fgh 


. (32) 


The five dots placed above letters, indicating detenninantal 
permutation as already explained, must, of course, not be con- 
fused with the dots in (32) which stand for zeros. The two uses 



44 


LINEAR PROPERTIES. 


[Chap. 

happen to have arisen quite independently, like the use of 
vertical lines | to denote a determinant or else a modulus of a 
complex number. It is an interesting study to discover how 
mathematics advances by the bold use of one symbol for two or 
more distinct things, letting the context decide. 

Comparing (32) and (34) a curious rule comes to light. The 
dots used in the two statements nicely balance each other. 
Letters c, d, e in (32) are dotted in (34). This should facilitate 
the proof of any such identity. 

Examples , — Prove 

(abc) {def) = 0, 

{(M) {efgh) ^ 0, 

i^cd) (efgh) = (abef) (cdgh), 

(abcdu) (efghu) = (abefu) (cdghu), 

9.^ Fundamental Identities of Order n. 

Let i j + k ^ n, so that 

denotes an w-rowed determinant whose columns are specified by 

9 ^2 ) * * * 9 9 9 ^2 9***9 9 ^2 9***9 

in this order. Let 

(DiEjF,) 

be another such determinant whose columns are specified by 

9 ^2 J ••• 9 d'ly , 6*2 , . . • , Cy , , ^2 9 * • • 9 fk * 

We form the determinant of 2n rows 

Bj Di , 

. . C, D, Ej F, ^ 

or simply 

A B C D . . 

. . G D E F 

meaning exactly the same thing. Now we subtract the upper 
n rows from the lower, as before, so that 

* The rest of this chapter, excepting §12, may be omitted on a first reading. 



QUADRATIC IDENTITIES 


45 


IIL] 


ABOD . . 


A BCD . . 

-= (- )‘ 

D BC 

A . . 

. . CDEF 


-A-B . . EF 

. ~B . 

-AEF 


on interchanging i columns D with i columns A. Expanding the 
first and third of these by a Laplace development, as in §8, p, 42, 

we obtain a sum of terms on each side of the identity; but 

many of these terms vanish, because of zero columns. On the 

/it + A . 

left the surviving terms, ( ^ 1 in number, may be written 

(A,BjC,){i),EjF,), 


On the right the upper row of capitals furnishes the first factor 
and the lower the second factor of a term. Each second factor 
has exactly i negative colunms. Hence the expansion is 


with ^ ^ terms. Equating these results we have the important 


identity 


{A,BjC,) (D,EjF,\^ (D,BjC,) {A^EjF,), 


(I) 


In particular if j == 0, t + A; == n, we merely suppress the matrices 
B and E, so that 


(AA){W=iDAMAiF,). . . (II) 

with only one term on the right, and ^ ^ terms on the left. 
Once more, from the equalities 


A,BjC,DiEj . 


ABODE . 

ABODE . 

. BjC^DiEjF, 


. BCDEF ~ 

-A ... .F 


we have on expansion 

(A,BfiS{b,E^F,)^(). . . . (III). 


These three results, which are of great use in the invariant 
theory, can be summed up in one statement: 

A sum of 'products of two TL-rowed determinarUs^ formed by 
determinantal permutation of p columns of one with q columns 



46 LINEAR PROPERTIES [Chap. 

of the other, is identically equal to a like sum, a single product, or 
zero, according as p + q <, =, > n. All columns undergoing 
derangement on one side of the identity are collected into one factor 
on the other side, except in the third alternative case. 

Letters which are so collected into one factor are said to be 
convolved in that factor. 

This theorem, which collects together results illustrated in 
§8, has only comparatively lately been recognized, although 
its simpler cases ^ when n — 2 , 3 , 4, &c., were studied from the 
very outset of the determinant theory. Sylvester was the first 
to establish it as in formula (II), but he did not arrive at the 
other cases (I) and (HI). 

A particular case of (II) when i = 1 can be written 

(aj 62 63 . . . h„) (biD ) — (aj 63 . . . 6„) (b^D) + , . , = (6163 • • • &«) (“i D) 

by substituting a^ for A^, 62^3 . bn for Cf^, for Di, and D for Fj^, 

This falls in with the Cramer rule of substitutions if we rewrite it as 

• • • K) (biD) + {K<hbz ■ • • b,,) (b^D) + . . . 

4 - (6x63 . . . 6,._xai ) M = ihh ■ • • K) Ki>). (IV) 

This last result is closely connected with the easily proved identity 
(V) given later, p. 60, where examples of its use will be found. 

10. Implicit and Explicit Convolution. 

In the above identities certain matrices are unchanged for 
each term. For instance, AiBj on the left of (I) remains unper- 
muted while EjFj^ is unchanged on both sides of the identity. 
It is useful to make the following distinction and to say that 
AiBj is explicitly convolved on the left and implicitly convolved^ 
on the right, while EjFf^, is explicitly convolved throughout. 
Similarly Cj.Di is explicitly convolved on the right and implicitly 
on the left. 

^In 1779 B4zout gave several simple cases. Cf. Muir, History of Deter- 
minanis, Vol. I, p. 41. 

’ * The importance of a word for this implicit convolution has .begun to be 
felt elsewhere. Such matrices are “herausgegriffenen”. Woitzenbbck, 
Annalen, 97 (1927), 794. 

These identities are given in the Trans, Cambridge Phil, Soc, XXI (1909), 
under identity B, p. 209, where incidentally there is an error in the sign, which 
should read ( — )•(« - and not ( — )««. Identity (IV) was formulated by Sylvester 
in 1839, and (II) in 1861 {Phil, Mag, (4), ii, 142-146). 



III.] 


GENERAL IDENTITIES 


47 


11. General Fundamental Identities of Order n. 

For the purpose of this present section let us use the notation 

(36) 

to mean adjacent w-rowed matrices within a determinant and 
not the ordinary product of matrices. On shifting i columns 
of A past B we obtain 

BjA~BA=^{~y^R (36) 

In the last section we considered sums of products of two n-rowed 
determinants. We now extend the results to products of p such 
determinants 

(AL), (BM), (CN) 

composed of n-rowed matrices 

A^^A,, .... (37) 

where i, k, , , , are p positive integers each less than n. 

We consider the following determinants A^, Ag, . . A^, , of 
orders n, 2n, . . . , pyi respectively: 


_ . ALB. 


ALB . C . 

^x-{AL), A 2 -- ^ j 5 

, A 3 — 

A . BM G . 



A . B . CN 


where each matrix A, B, C, . . . is repeated, while each L,M,N, . . . 
occurs once. When expanded as in § 8 , p. 42 by a Laplace 
development into determinants of order n, these become 

Ag - (AL) (BM), A 3 - (AL) (BM) (CN), ... . (39) 

for again the zero columns prevent such matrices as L, M, N 
from being permuted. Also by rowg — row^, row 3 — - row^, . . . 
in (38) we have 

. A L B . \ A LB. 

^ A . B m\ \ -L . M ' 

A L B . C . A L B . C . 

A3= a . B M C . = . -L . M . . , (iO) 

A . B . C N . - L . . . N 

&c. 



48 


LINEAR PROPERTIES 


[Chap. 

Again expanding by a Laplace development Ag leads to results 
§9, (I), (II), (III), p. 45, already given, while Ag, A 4 , . . give 
analogous identities which may be stated as follows: 

(i) + k+... = n-~h<n, and = Lj" L^'" . . .Lj/, 
hy partitioning the n — i columns of into its first j, next 
k, . . . , and last h columns. Then 

(AL) (BM) (CN) ... - (ABC . . . L') (L"M) (L'"N) . . , . (I) 

(ii) i + j + k + • • • = n, and — Lj" Itf” . . . , then 

(AL) (BM) (CN) ... - (ABC . . . ) (L"M) (L'"N) .... (II) 

(iii) i -|- j 4 * k + • • • >n> then 

(AL) (BM) (CN) . . . = 0 . . . . (Ill) 

The only difficulty here iwS to settle the sign on the right hand 
of (I) and (II). The case of Ag for (I) suffices to justify the result. 
Writing L ~ and interchanging the first j columns Z" 

of Z with the j columns of Z, and the k columns Z'" with C, 
we have 

ABC V Z" . Z'" . 

A 3 =(-~)^‘+^ . . . -Z'-Z"M--Z"' . . 

. . . -Z' -Z" . --Z'" iV 

Expanding and shifting all negative signs out of the factor 
determinants we have 

A 3 == (~)2^'+2'4^fi(7Z') (Z"M) (Z"'A). 

Similarly for A 4 , . . . , A^. 

We can now enunciate these results in the following state- 
ment: 

A sum of products of p determinants each with 11 rows, formed 
hy determinantal permutation of i columns of the first, j columns 
of the next, and so on, is identically equal to a sum of such pro- 
ducts, a single product, or zero, according as i -f j -j- . . . <, 
= , > n. All columns undergoing derangement on one side of 
the identity are convolved in one factor on the other side, except 
in the third alternative case. 

Corollary I . — If each of i, is unity, the , corresponding 

series on the left of the identity can be written as a p-rowed 



COMPOUND DETERMINANTS 


III.] 


49 


compound determinant, namely one yohose elements are deter- 
minants. 

Thus, replacing A, B, C hy a, b, c, and X, M, N by A, /x, v 
we have 


(«A)(V)(;.) = 


(aA) (6A) (cA) 

{ufi) (b/x.) (cfj.) 
{av) (bv) (cv) 


where A, /x, v have currency n— 1. 

It is useful to use Greek letters for these very important 
matrices involving n — 1 columns. 

Corollary II. — If L, M, N . . . have certain columns in common, 
these columns do not undergo derangement. For the consequent 
terms would all be zero. 

Corollary III. — Each identity for n-rowed determinants can 
be extended to n m-rowed determinants, simply by affixing a 
coimnon matrix 0 of currency m to each of L, M, N . . . , and 
considering all matrices to have n + m rows. 

This is the principle of extensionals. Incidental examples have 
already been given (§8, (29), p. 42). 

Proo/.— Let {AL) {BM) = {ABL') (/>'M), L - be 

identity (I) for n-rowed determinants. Al^o let 

{AW)(BM(d) 

denote a product of two n f m-rowed determinants, where 
A, L, B, M have received m new rows and 0 has n + m rows 

and m columns. On permuting A, B in this we have an identity 

(ALQ) {BMQ) = Z{AB ...)(... M0). 

If a column of the 0 from the first factor is displaced, the con- 
sequent term vanishes by corollary II. Thus L alone is deranged 
and 

(ALQ) (Mf0) ^iABUQ) (P"M0). 

Corollary IV . — Each fundamental identity remains an identity 
after any further determinantal permutation has been applied to 
columns o/ M, N . . . and other arbitrary columns, excluding 
those o/ A, B, C, . . . , L', L", L'", . . . which are already implicitly 
convolved. 

( D 884 ) 


5 



so 


LINEAR PROPERTIES 


[Chap. 

For if the new operation has t terms, the fundamental identity 
is true for each such derangement, so that the sum of these t 
identities is still an identity. 

Example. — Since {all') {bnim') =- {abl') (imm') we infer that 

{all') (bmm') {nX) — {all') (bmn) {m'X) 

— {abl') (Imm') (nX) — {abl') (imn) (rn'X), 

where a new operation ?«/, n of two terms takes effect. 

12. Linear Relation between n \- \ Linear Forms. 

The identity (IV) (p. 46) can also be proved as follows: 

. We form a vanishing determinant of order n -|- 1 from n 
arbitrary rows followed by 

row,,^i =-- a*! rowj + row^ + . . . + row,,. 

Thus 

••• ^11 

/ / ' * ‘ \ ' -= 0 , 

Wx ^^2x • • • ^nx ^Ix 

where 6,-^ = p + • • • + 

Expanding this by the last row, we have 

(6263 . . . b,^ai) byj. {bib.^ » . , ^n^i) ^2x • 

+ (V) 

If in particular are the n determinants of the matrix 

D, which consists of n ~ 1 columns, the result reverts to (IV). 
Since the elements a, 6, x are all arbitrary, identity (V) gives the 
important information formally proved in Chapter V, §1, that 
Any n + 1 homogeneous linear forms in n variables are neces- 
sarily linearly related. 

For 61 ^, . . . , a^^ are such forms, and (V) is such a linear rela- 
tion, provided not all the determinants appearing in (V) vanish. 


EXAMPLES 

1, Prove by this method if w = 2, 

(be) ax + (ca) hx + (ab) Cx = 0, 

(he) (ad) + (ca) (hd) + (ah) (cd) = 0 



III.] LINEAR RELATION BETWEEN n + i FORMS 51 

2. If w = 3, 

(bed) ax -f (ccid) bx + (obd) Cx ~ (abc) dx, 

(bed) (aef) -f- (cad) (bef) -j- (abd) (cef) — (abc) (def). 

3. Examine the identity (IV) when /) is a unit matrix prolonged by 
zeros: as 

/)=.- r! i1»= 3; ^ • • 

L- . i 

4 . Prove if i = 1, 2, 3, 

(bed) fii -f (cad) bi f- (abd) a — (abc) di, 
and generalize this result. 

5. Prove as a special case of the Sylvester identity (II), 

(abef) (c(i)i2 == (abed) (ef)^^. (abef) (cd)ij = (abed) (ef)ij. 

6 . (abpqr) (<^)^.^ = (abcAr) (pg)i2, 

(abpqr) (cd),:} (abedr) (pq)ij* 

13 . Principle of Duality. . 

There is still something more to be learnt from these funda- 
mental identities. The identity (I) arising from a product of j) 
n-rowed determinants may in fact be looked on as the result of 
alternative Laplace expansions of the j?n-ro\ved determinant A^. 
But, as already remarked, this A^ gives rise to p -{- 1 equivalent 
expansions, according as we expand A^, directly, or after sub- 
tracting its first, second ... or pth matrix layer from all the 
p — 1 remaining layers. Thus if p = 3 , i + i + ^ we take 

A L B . G . 

A3= a . B M C , 

A . B . C N 

where the currencies of the six symbols in order are 

i, n —S, j, n.— j, hy n—k. 

Hence by shifting columns to bring ABC before LMN, we have 

A B C L . . 

A B C . M . 

ABC. . N 



( 41 ) 



52 


LINEAR PROPERTIES [Chap. 


Writing ^3 to denote this power of negative signs, and R for 
ABC of currency i j k— n — h, we have 


and in general 


A3 — €3 


R L . 

R . M 

R . . 


N 


R L 

R . M . . . . 

... 


. (42) 


. . (43) 


As in §7, p. 39, we may, but for sign, transform so as to 
bring any letter X, M, N, repeated p times over into the first 
column, with the other p single letters, including R, in any order 
diagonally. For this reason we write A^, as 

(^n-h ^n-j ^ n-k * • •)> ... (44) 

or shortly {RLMN . . . ). It can then be shown that the inversion 
law of these symbols is exactly the same as for the n-rowed deter- 
minant 

regarded as an expression in the p + 1 matrices of n rows^ 
E, A, B, C, . . . , o/* currency indicated by the suffixes. 

Manifestly 

= . (45) 

and correctively it will shortly be proved that 

(RLMN...) = {-y‘^(LRMN.,,), . . (46) 

Sometimes we heed to partition one of these matrices, and 
therefore require a more explicit notation, namely,- a full stop 
between the several matrices. Thus if 72 = ABC, 

^^=(RLMN)^(ABC.LMN), . . (47) 

which is also written with a vertical line instead of a stop, 

(ABC I LMN), 

On reference to (II) we have by expansion 

{ABC\LMN) = {AL)(BM){CN). . . (48) 



PRINCIPLE OF DUALITY 


IIL] 


53 


The currency suffixes are suppressed .only when all the matrices 
have expressly been defined. Otherwise they are necessary. 

When several matrices in {RLMN . . . ) are not denoted by 
single letters, the full stops between are essential. Thus if 
a,b,c, . , , are all of unit currency, denoting columns of four-rowed 
determinants we might have 

{ab . cde .fgh) ™ (acde) (bfgh) 

“ (acde) (bfgh) — {bcde) (afgh), . (49) 

or again 

(aa'a" . bb'b" . cc'c" . dd'd'') = (abb'b") (a'cc'c") (a"dd'd"). (50) 


Proof of the Law of Transposition. 

A reference to the original definition of A 2 , A 3 , . . . , A^ shows 
that it is enough to prove the law for adjacent transposition of 
i?, L and of Z, M, 

Taking the Z, M case first and using A 3 for brevity, we have 
A 3 = (RLMN) = (ABC I LMN) 

(AL) (BM) (CN) 

= (BM) (AL) (CN) . (51) 

by the commutative law of ordinary multiplication applied to 
each term of this series. This yields 

A^=(BAC\MLN). 

But by (45) the interchange of AB induces ij changes of sign 
in R. Hence 

A 3 - (RLMN) - (-^yj (RMLN), . . (52) 

Next, for the transposition of i?, Z we have 

A 3 = (AL) (BM) (CN) - (ABCU) (Z"M) (Z"'iV) - 

by the fundamental identity (I), when Z == U'L"*L\ Here the 
currency of Z' is h (= n — i — j — k), to give n columns for 
the first factor. Hence by shifting columns 

A 3 = (-^f'^^'-^^^UABC) (Z"M) (Z"'iV) 



LINEAR PROPERTIES 


54 


[Chap. 


Again, shifting the Ji columns U past the j -f h columns 
i'", we have 


A3= I RMN) 

= {-^{LltMN) 


by simplifying the sign index. For this index is 

h{n — A) + ^ 0 + ^ “h • • •) 

~ }i{n — h) h{n — h — i) = hi (mod 2), 


leading to the same result. This proves the law. 

Definition of Formal Duality: — Two n-rowed matrices of cur- 
rency h and n — h respectively are formal duals of each other for 
the field of order ii. 

Formal duality is a relation between numbers of columns: 
the elements therein may be quite arbitrary. 

In the above investigation A is formal dual of L, B of M, 
&c., and conversely. We extend the definition to include the 
determinants given in (45) and (46) as formal duals of each 
other, and accordingly sum this up with a more symmetrical 
notation as follows: 

Let n be partitioned into y + 1 positive integers i^, 4, 

• • • J 

h + ^2 H“ • • • 

SO that 

{n — ii) + (w — ig) + . . . + (>^ — ipi-i) -== np. 

Let A, B, ,,,, K [p matrices of n rows, of currencies 
ij, ig, . . . respectively, and L, M, N , , . . ,Q be (^ + 1 ) such 
matrices of currencies n — i^, n — i^, .... Then the n-rowed 
determinant 

(ABC...K) 


is formal dual of the np-rowed determinant 

{LMN...Q), 


More expressly with currency suffixes inserted, these dual deter- 
minants are 


It is also convenient to have a term to describe what is in 



III.] FORMAL DUALITY, CHARACTERISTICS 55 

fact the really important thing about these determinants, the 
relation between their currencies. 

Definition of Characteristic. — The sets of integers 

[ll, 1^2, . . . j ip f ih ^ • * • » ^ 

are the characteristics of the respective determinants. 

For example, in results (49) and (50) the characteristics of 
the left-hand members are (233) and (3333) respectively. Since 
they are concerned with the field of order 4, their formal duals 
are (211) and (1111). 

These examples, as illustrations of the more general 

{ABC I LMN) (AL) (5M) (CN) 

are extremely useful. Taken in conjunction with the funda- 
mental identities, and especially (I), they readily lead to many 
elegant results, particularly when special values are given to 
the matrices as in the following examples of compound deter- 
minants, which illustrate the characteristic (n — 1, n — 1, . . ., 
n 1), dual of (11 ... 1), (cf. §11, Corollary I, p. 48). 

Historical Note. — The chief properties of such compounds 
which have been recorded by Cauchy (1812), Bazin (1854), Syl- 
vester (1841), Whittaker (1915) are still comparatively unknown. 
Perhaps this is due, especially in the far-reaching cases given by 
Sylvester, to the rather cumbrous notation originally adopted. 
It is hoped that the present notation will help the reader to grasp 
the principles underlying these theorems. 

Mum; History of Determinants y Vol. I; Vol. II, especially pp. 58-62, 108. 
WurTTAKER: Proc. Edinburgh Math. 80 c., 36 (1918), 107-115. 
Turnbull: Proc. London Math. Soc.y 2 , 22 (1923), 503-607. 

Ferrar: Proc. Jjondon Math. Soc., 2 , 23 (1924). 


1. Prove 


EXAMPLES 


a, 6, J, . 

02 t>2 I'i 

Ui bi . Ml 

( l •2 ^2 * 


=; (bl) (am) — {al) {bm). 


2. If each letter denotes a column in a three-rowed determinant, write 
out m full the identity {apq){br 8 ) (ctu) ~ (abc) (prs) qtu). Prove 

{abc) (bca) {cob) ~ {abc)^. 



LINEAR PROPERTIES 


[Chap III. 


3. If a = pq, p = Y — tii, prove 

(apY) = (prs) (qht) - (qrs) (jHu) 

= — (^M) + (m) ('Ttn), 

4. Let A -- {ahc . def . ghk . Imn) denote the formal dual of the four- 
rowed determinant {xyzt)^ each letter representing a column of four 
elements. Prove 


5. Prove 


{adef)(b(j/ik)(dmn), (dabc) (eghk) (fbnn). 


{ahc . hep . cqr . stu) ~~ {abcp)(bcqr)(csiu). 


6. Generalize result 5 for a product of n — 1 determinants expressed 
as a compound determinant. 


{ayz) {bzx) {cxy) -= {ahc) {xyzf. 

8 . 

(ayzt) {xbzt) {xyct) (xyzd) = (abed) {xyztf, 

9. Generalize 8. This is Bazin's Theorem (cf. V, p. 108), 

10. If [ab . . ] is the unit matrix, state the results 7 and 8. 

Ans, Cauchy’s theorem on the adjugate, §7, p. 07. 

11 . 

(ayz^) (hzxQ) (exyO) ~ (nbcO) (xyzO)^, 


(ayztO) (xbztQ) (xyetd) (xyzdO) — (abedO) (xyztOf. 

13. Generalize 12. This is Sylvester's Theorem, It is the extensional 
of Bazin’s Theorem. 


14. 

(bed , acd . xyt . xyz) = — (abed) (exyt) (dxyz), 

15. 

(hale . aede . ahde . zyxu . xyzt) = — (abede)^ (dxyzu) (exyzt), 

16. The generalization of 14 and 15 is Whittaker's Theorem, 



CHAPTER IV 


Multiplication op Matrices and Determinants 

1. Fundamental Laws of Algebra. 

We now come to the crucial distinction between matrices and 
ordinary numbers: they do not obey the same law of multipli- 
cation. To gain a clear picture of what is here involved we must 
first recall the four fundamental laws of algebra, on which the 
whole superstructure is based. These are: 

I. The associative law; 

II. The distributive law; 

III. The commutative law; 

IV. The division law. 

The first three lead to six primitive facts concerning ordinary 
addition and multiplication of ordinary numbers: 

I (i) (a-f 6) -|- a l (6 f- c), (ii) (a X 6) X a X (6 Xc), 
II (i) ax (6-f- c)~axb\~axc, (ii) (a+6)xc=^axc + 6xc, 
III (i) a b ~ b a, (ii) a X b b X a. 

The first pair render the use of brackets unnecessary for con- 
tinued sums a + 6 f c, and products abc. The fourth law runs 
as follows: 

IV. //* a X b = 0, then either a — 0 or b ~ 0. 

As Holder ^ has shown, any algebraic identity concerned with 
real or complex numbers can be deduced from the above laws 
I, II, HI. For instance, 

{x + y){x—y}=x^ — y^ 

is true, but could not be proved merely by the use of the first 
five: it also requires the law xy = yx. On the other hand 

^Oottinger Nachrichten, 2 (1889), p. 34. 

57 



S8 MULTIPLICATION OF MATRICES [Chap. 

{x + y){a -|- 6) “= xa-\- ya-\- xb -f- yb does not need the com- 
mutative law. Now this sixth law is by far the most interesting 
of them all. There can be little doubt that whoever first discovered 
it, was led thereto by the orderly arrangement of a things in 
each of b rows or of b things in each of a rows. 


Again, these first six laws are not entirely independent, for II (ii) 
can be deduced from the others with the help of III (ii), as the 
reader can quickly prove. But curiously enough the reverse is 
not the case. The exclusion of the law a X b == h X a actually 
renders the two distributive laws independent. 

These laws hold of other classes of things besides numbers 
a, ft, c and other operations besides addition (+) and multipli- 
cation (X). Also some but not all of the laws hold of still more 
things and operations. For example, similar laws operate in an 
English sentence. The preceding sentence can be regarded as 
the sum of nine words, or of forty-seven letters, addition having 
its ordinary meaning. But if we define multiplication to mean 
the building of a sentence by putting words into their proper 
order, then such multiplication does not obey the commutative 
law. The sentences Brutus stabbed Caesar, and Caesar stabbed 
Brutus mean entirely different things. On the other hand the 
Latin translation of one of these sentences would obey the 
commutative law. 

About ninety years ago two great pioneers in higher algebra, 
Hamilton^ and Grassmann*^, initiated schemes of algebra in 
which the symbols a, b, c did not obey the commutative law of 
multiplication, although they satisfied the other five I, II, III (i). 
The accumulated experience of intervening years has amply justi- 
fied these daring departures from the traditional rule, for it 
shows conclusively that between the first five laws and the other 
two there is a profound cleavage. In other words, many algebraic 
theorems can be proved without recourse to the commutative 
law of multiplication, thereby opening for algebra entirely new 


^ Dublin Transact., 17 (1837), 393. Lectures on Quaternions (1853). 
^ Ausdehnungshhre: Collected Papers, Vol. I (2). 



LINEAR TRANSFORMATION 


IV.] 


59 


‘ fields, including matrices, vectors, and those latest adumbrations 
in physics, g'-numbers, for the more obvious annexations. 

In terms of the first three laws I-III let us speak of 
Linear Associative Algebra, 

subdivided into 

A. Commutative. B. Non-comniutative. 


Ordinary (scalar) algebra is of type A: that of Hamilton, Grass- 
mann, and the algebra of matrices is type B. In type B, law 
III (ii) breaks down. Were it not to break down there would 
be no raison d'etre of a matrix theory; for every theorem of 
ordinary algebra would automatically hold for matrices, vectors 
and the like. 

Nevertheless it is a singular fact that the matrix whose very 
pattern forces on the eye the propriety of the commutative law 
should be among the first triumphantly to break it! 

2. The Law of Multiplication of Matrices. 

This law, which Cayley invented and his successors have 
approved, takes its rise in the theory of linear transformations. 
Let us consider a simple ease with two variables, before and after 
transformation. If 

= Pi + ?»2 22 
+ 2/2 = ?i2i + 9 ' 2 Z 2 . 

we have what is called a system of two linear equations trans- 
forming X2 to yi, and again two equations transforming 
2/1 j 2/2 ^2- We write 


A % 

, ^ , 


, B = 


z= 


-^2-! \j^2 ^2- 

Lys 



qzJ 

-h- 


A and B are called the matrices of the respective transformations: 
X, y, Z are the matrices of the variables. The two transformations 
are sometimes written" as 

A— y, y — z. 

But by eliminating y^ we have at once 

Xi = + (OiPa + 6,?2)22 

*2 = {«2Pl + ^»25 'i)2i + («2P2 4- hq^Zz, 



6o MULTIPLICATION OF MATRICES [Chap. 

which is manifestly a linear transformation from ajj, direct to 
Zi, Zo- If C denote its coefficient matrix, then 

^ 2 Pl + ^2?1 J ^2 jP2 + ^2?2 J 

Here we have the suggestion of a product of matrices. In fact C 
is defined to be the product of A and B, nainely 

r% ?’2'] = r <^iP2+w%i 
Laj 6jJ L?1 q 2 J LazPi+h^qj, a^p^+hq^J 

In short A X B = AB = C, 

Thus the product of matrices is based on that of linear trans- 
formations, and we might, for example, indicate this by 

(X_y)(y_Z)=(Z-Z), 

meaning, the transformation from X to F followed by that from 
Y to Z produces the direct transformation from X to Z. And 
there is no harm even in speaking of the transformation A^ mean- 
ing that from Z to F whose coefficient matrix is A. 

Now it is at once apparent from the definition that the pro- 
duct BA in general means something different frorn AB. Thus 

«! Pi + «2P2 > WPi 1 ■ KPil _ J) 

P-iqi-r ^qz, biqi+b^qzJ 

which is only equal to AB if all four corresponding elements 
in D equal those in C: e.g. — a^p^ + a2P2» 

Obviously this is not true in general since the eight elements 
% • • • ?2 arbitrary numbers. 

Next, we extend this definition to include other than square 
matrices, by analogy from the case 

r% TVi 01 rai2/i + 612/2, 01 

Lag 62 J U2 oj U2 2/1 + 62 2/2, Oj" 

We define 

AY= 1 ‘^ ^x] [22x1 ^ K22 x + 612/2] = f'^^x] _ x. 

U2 62 J L2/2J La222x + 622/2J 1^2 J 

Hence the formula X = .4 F literally represents the linear trams- 





UNIT AND NULL MATRIX 


6i 


IVJ 

formation of a column of variables Xi, 0^2 Vv Vi- Simi- 

larly 

Y-JBZ, Z==CZ, X^(AB)Z, X=A{BZ), 

This last suggests that for any conformable square matrices the 
associative law holds, which is in fact the case; namely 

P{QR)r.-{PQ)R, 
which will be proved in §4, p. 63. 

EXAMPLES 

' Af R> 0 are arbitrary square 

matrices of four elements each, prove 

(i) I A AI - A. 

(ii) OA AO = 0, 

aii) A{B-j-C) ^~ AB-^ AC, 

(iv) (A~\- B)C==AB+BG. 

2. If X, y are scalar numbers, A (xl) — xAI — xA. Also A (xB) ■= {xA)B, 
xAyB — yxAB, 

3. Product of Square Matrices of Order n. 

The general case is now straightforward. Let .4, JS be two 
such matrices of order n, 



®i w • 

. Ml “ 

, 5 = 

~ Pi 
9i 

P 2 

?2 

... j>„' 

••• 





_ h. 

h 

• • • K . 


Let the ith row of A be denoted by rg, . . . , and the jth 
column of B by Cg, . . . , We form a new matrix C by 



weaving this ith row of A into thejth column of B by the follow-, 






62 


MULTIPLICATION OF MATRICES [Chap. 

ing rule which defines what is called the inner product {r | c) of 
these two sets of n elements 

(r j c) = rjCi + rgCg -f- . . . + r,,c ,, . 

This sum is taken for the (i, j)th element of C\ and by choosing 
all values 1,2, . . . , n for i and for j we completely determine 
the elements of C. Evidently this rule includes the case 
already cited when n — 2. Also the notation (r | c) is entirely in 
agreement with that of §13 (48), p. 52, as will be apparent later 
in Chap. V, p. 81. 

This notation (r\c), sometimes written for the inner pro- 
duct is admirable for exhibiting the product of two matrices. 
For instance. 


^2 ^3 


Pi ?i r{ 


-(a Ip) (o|?) (a|r)- 

b\ ^2 ^3 

X 

Vi % ^2 

= 

(6|p) {b\q) {b\r) 



-n ?3 'Tz- 


.(c|p) (c|( 7 ) (cjr). 


Product of Rectangular Matrices. 

The definition may easily be extended to the product of 
rectangular matrices. It is only necessary for the fore and after 
factors to have an equal length of row and column respectively. 
Thus for a length of three terms 



^2 ^3 

^2 bj ^ 


'Pi 

P2 


LPs 


91 

92 
93- 


(alp) {a\q)' 
{h\f) (61?)]’ 


and the extreme instance is the single row with the single column 


K ••• ^•nlX 


= riCi4-... + r,.c„=(r|c). 


EXAMPLES 


r 1 

2 3 

1 ^ 2 -| 

2 

] 3 

X 0 1 1 

1 

0 2 

12 0 L 


2. Reverse the order in the above. 



IV.l 


DOUBLE SUFFIX NOTATION 


63 


3. Form the square and the cube of the matrix A =-■ 


■- 11-1 
- 32-1 
-3 1 

1 . 


n 


Ana. ^1® is the miit matrix (§8). 

4. Prove that if [w?, r| denote an m X r matrix, \m, r\ X ,‘r, n\ — \m n\. 


5. If X denote a column of n variables , Xn^ and if Y, Z have 

similar meanings for ?//, z,, gene^ralize the results of linear transformation 
already given when n 2. Namely, A - A Y, Y - HZ, X - CZ - ABZ. 


4. Double SufiSix Notation of Multiplication. 

It ifi liere that the double suffix notation also is very useful. 
Let 

be two square matrices of order n, so that a;^ is the element in 
the ith row and jth column. Then the inner product of the 
ith row of A and kth column of B is if 

% ^ l K.- + ^r2 Kk + • • • + Clin ^nk 

= («; I K). 

Thus ii C ^ AB, then 

c,,, = S a,y = (Of 1 bk) (1) 

The notation lends itself to continued products. So w^e should 
have 

Xu=-'L'Ea,jbikCki 
j k 

in the case when 

X=ABC (2) 

It will be seen that the suffixes on the right are linked in 
pairs, with unpaired end suffixes i, I answering to those of 
on the left. The double series has terms; and since we obtain 
exactly the same result whether we sum for k first and for j next, 
or vice versa, we are justified in dropping the brackets in A(BC 
and {AB)C. They mean the same thing. 



64 


MULTIPLICATION OF MATRICES [Chap. 


5. The Division Law. 

The law IV (p. 57), that if xy — 0 then either x or y must 
be zero does not hold for matrices. For example, let 


a b _ 

" b b' 

, B = 

-0 0. 

_ — a — a_ 


then 

AB=\’‘ ‘I = f" "Vo, 

LO Oj \_~a —aj LO Oj 

Yet neither factor A nor B is the zero or null matrix. Consequently 
this law fails for A and B. 

At the same time the law holds for certain matrices; for 
instance, if the determinant | B | were non -zero it could be proved 
that AB = 0 only if A ==■ 0. 

The reason why this is called the division law rests on the fact 
that it defines a unique process converse to multiplication. For 
suppose AB = C and AD -- C are two instances of multipli- 
cation in linear associative algebra which give the same product 
C. Then A(B — D) = AB — AD — C — • C ~ 0 ; so that if 
.4 =1= 0 the law shows that B — D 0 or B ^ D, Hence 
there is only one possible factor following a given A in a product 
AB which can produce a given (7. 

In this case A is called the left or fore factor and B the right 
or after factor of (7. Looking at the same relation conversely we 
consider B to be the quotient when C is divided by A\ more 
precisely, B is the quotient of left or fore division of (7 by 4. 

Similarly, if both AB — (7, EB — (7 it follows from the same 
law that A~E. Hence there is only one factor A preceding a 
given B in a product AB which can produce a given (7. So con- 
versely A is the quotient of right or after division of C by B, 

To sum up, there are in the case where AB, BA are not 
necessarily equal, two sorts of multiplication — fore and after 
multiplication, and two analogous sorts of division. But these 
last require the division law to hold. Algebra for which this law 
is true is called division algebra.^ 

^ The reader who wishes to study this very important law should consult 
Dickson, Linear Algebras (Cambridge, 1914), particularly the theorem of p. 10,^ 
due to Frobenius, Crelle, 84 (1878), p. 59; also Dickson, Algebras and their 
Arithmetics (Chicago, 1923) and the revised and German edition, Algebren und 
ihre Zahlentkeorie (Zurich, 1927). 



IV.] 


PRODUCT OF DETERMINANTS 


65 


EXAMPLES . 

1 . A matrix product with the zero matrix for factor w zero. 

[Note that two proofs are necessary, for left and right factor.] 

2. If ad =i= hcy xt + yz, then the product p cannot bo zero. 

3. The first five laws always hold and the sixth sometimes holds for 
matrices. 

Consider ^ = [3 4 ]. ^ = [_3 ” 1 ]. "= = - 2A 

Wc consider this sixth law further in §§8 and 9 below. 


G. Products of Determinants. 

The product of two deiermmanis each of order n is itself a deter- 
minant of the same order. 

Consider, for example, the product of {abc) (ajSy). It can be 
written as a six-rowed determinant: 

q r - 1 

a^ C2 . — I 

% h ^3 • • 1 

... Cti^ CLrt Ctjj 

Pi P'l P3 

• • 71 72 73 




since the 
term 


Laplace development contains only the 


(abc) (aPy), 


Replace in A rows four, five, and six by the following equivalents: 


row4 + rowi -f rowg + row3, 
rowg + rowi + ^2 rowg + ^3 row3, 
rowg 4- yi rowi + y^ rowg + yg rowg. 


Hence A ~ 


Ui 

6. 

Cj 1 • • 


K 

0*2 • 1 • 


h 

Cg . . — 1 

aiai+«2a2+a3a3 

6 iai+ 62«2 + *»»«3 

Ciai + Cgag-fCgag 

?l“t"^<^2 P2't‘®3p3 

^lPl+^2P2 + ^3P3 

Pl + ^'2p2'4“^3p3 

aiYid-a2Y2+a3Y3 

&iYi+ 62 Y 2 + *>3Y.1 

<JiYi+<’2Y2 + C3Y3 * 


( D H84 ) 


6 



66 


MATRICES AND DETERMINANTS [Chap. 


Again expand by the 


^ ^ ^ development 


and we obtain 


6^ Cj 


Ctl tta ttg 


-1 . . 


{a 1 a) (6 1 a) (c | a) 

^2 ^2 ^2 
ag 63 C3 

X 

P 2 ^3 
Yi 72 73 

-=(-)3 

. —1 . 

. . -1 


(a\p){b\p){c\P) 
(a\y) (6jy)(ciy) 


The final determinant is called the product by columns and rows. 
It is often written with replacing the notation {a | a) for 
which gives the following product by rows and columns: 


(a^y) (abc) =- 


«a K (^a 
^/3 


. . (3) 


Corollary. — The product rule of matrices agrees with that 
of determinants. Thus if 

then |^1 |5| - [ C|. 

The converse is not true; for owing to the great variety of patterns 
of the same determinant 


A = X ± 

made by interchanging rows and columns without disturbing 
the actual contei-ts of a row or column, there are many equivalent 
product determinants. This is by no means true of matrices. 
Herein is the manifest difference between matrices and deter- 
minants, as interchange of rows and columns gives a different 
multiplication rule valid for determinants but not for matrices. 
But undoubtedly the rule as given in (3), of weaving columns 
into rows, thus forming all possible inner products of a row of 
the first factor and a column of the second factor, is the best to 
store in the memory. But occasionally it is useful to multiply 
rows by rows, or columns by columns. 


7. Reciprocal and Adjugate Determinants. 

Let us now use capital letters to denote co-factors of elements 

in a determinant ait i 

A= I 

so that 

in the notation for an inner product just explained. 



IV.] RECIPROCAL AND ADJUGATE DETERMINANTS 67 

Since \ h^h^c^ . . . m„ | has two equal columns, it vanishes. 
Thus 

0 — + 62^2 + • • • + > 

which may be written 6=0. 

In general = 

for exactly the same reasons. Correlatively we have 

A = A.^ -j’ ^1*®! "f" • • • "f" 

w^hich we typify by and 

0 CtoAj^ -f- ^2"®! * • • ~i” 

which we typify by 2 ^. So in general 

i = 0 i 4 =j 
= A i= 1, 2 , ... n. 

Definition. — The determinant 

... 

Ao Cg ... 


A„ B„ C„ ... 

is called the adjugate of A. Its elements are the co-factors of the 
corresponding elements of A. 

Since by mnltiplication 

A 0 0 ... 0 

0 A 0 ... 0 

0 0 A ... 0 =A“ 


0 0 0 ... A| 

it follows that provided A 4= 

A = 

a very beautiful property, due to Cauchy.^ 

Definition. — The determinant whose elements are those of the 
adjugate each divided by A is the redjyrocal of A. 

^ Journal de V icole Polytechnique, 17 (1815), 82. 





68 


MULTIPLICATION OF MATRICES [Chap. 


It can be written as A ^ or 


I 

A’ 


since 


A 

A’ 

A 

A’ 


A 

A 



B, 


= A-"A = 


] 

A‘ 


We have multiplied each row of the determinant by A, at 
the same time dividing the result by A’*. This is an example 
where the matrix theory would differ. Thus by matrix definition 


FA, B, 
A’ “A 

B, 

A ’ A 



B, 

B, 


where is now the factor instead of A 


8. The Index Law and the Reversal Law of a Matrix. 

There is no ambiguity in writing for the product of a 
square matrix A with itself. It follows by the associative law 
that if r is a positive integer, A^ is a useful abbreviation for the 
continued product of r equal matrices A, Further, as in ordin- 
ary algebra, the index law 

A"' A''=^ 


will hold for all positive integral values of r and s. And, if 
I ^ I =(= 0, we may allow r, s to be any integers, zero and 
negative included, by adding the following two definitions, for 
the unit matrix and the reciprocal matrix. 

Definition of Unit Matrix. — The square matrix whose leading 
diagonal elements are each unity, all other elements being zero, is 
called the unit matrix. 

Definition of Reciprocal Matrix. — The square matrix [a^] 
whose (i, i)th element is the co-factor of ajj in the determinant | ajj |, 
divided by the determinant itself, is called the reciprocal of the 
matrix [ajJ. 



IV.] 


THE REVERSAL LAW 


The reasons for these definitions are simple, for they are 
analogous to those of elementary algebra. Thus, if I is the unit 
matrix and M is a square matrix of the same order n, then, 
taking n — 3, 



-1 . .* 


ci'i Cj 


. 1 . 

, M = 

^2 ^2 ^2 


. 1. 


— 1 

« 


whence by actual multiplication 


so that the unit matrix as a factor leaves another matrix un- 
changed. Indeed the unit matrix is seen to be commutative with 
another matrix. 

Also if N is the reciprocal of M, then by definition 


A2 Ag 

A ‘A A 

i?2 -B3 
A A A 

C, C2 C, 

A A A . 



where A^ is co-factor of in A, the determinant of M, Thus by 

actual multiplication 

n . .1 





. ( 6 ) 


Similarly MN == L Thus a matrix is commutative with its 
reciprocal, and if we define M~^ to mean the reciprocal of M 
we have the result 

(7) 

This allows us to use the notation ilf to mean indifierently the 
rth power of the reciprocal of M or the reciprocal of the rth power 
of i¥. Also, provided | AT | 4= 0, we have AP = L 

As examples of this index law the reader should prove the 
following results, noticing the curious feature which emerges — 
pne of great importance in all non-commutative algebra -that 



70 


MULTIPLICATION OF MATRICES [Chap. 


the inverse or reciprocal of a product- of factors reverses the order 
of the factors. 


{ABC . . . K}-^ = K-K.. 


• ( 8 ) 


BA. 


Further, the same reversal is true of the operation of transposing a 
matrix, which is denoted by an accent. 

{AB)' =---B'A’,kc (9) 


Lastly, the aperations of inversion and transposition are co7n- 
mutative: 

(A'r^=(A-^y (10) 

Singular Matrices. 


When I A | vanishes, the matrix A is said to be singular. 
It has no reciprocal, although there still exists the matrix of 
elements [a^,] where a,j is the co-factor of a;j in [ i4 | , which has 
certain properties analogous to those of a reciprocal matrix. 
This is called the ad jugate matrix. 

It is easy to adapt the result (6). Thus the product of A 
and its adjugate gives 




Ml 


MM=-MI- • (11) 


p 

This is also true for n rows and columns. In particular the 'pro- 
duct of a singular ynatrix and its adjugate is the zero matrix. 

Here is an example where the division law fails: neither 
factor need necessarily be zero. 


9. Summary of Laws of Matrices. 

We conclude this chapter by summarizing our results, for 
we now hold all the fundamental laws of matrix algebra. First 
there are three fundamental operations, 

Addition, 

Multiplication, 

Transposition, 

this last being new, for it does not occur in ordinary algebra. 



IV.] 


SUMMARY OF LAWS 


“t 

Denoting transposition by an accent, the laws which govern 
matrices are 

(A-\-B)\-a A + {B-\-C) {AxB)xC=A x{BxC) 
A X {B + C) =- AB -I AC {A -|- B) x AC+BC 
A+B=B+A 

{A + BY-~-A' + B', (Ay=-.A, (ABy = B'A', 

(AB)-^ = B-^A-\ 

(A-y = (A'r^. 


The commutative law of multiplication fails in general, but 
holds at any rate in certain cases, namely 

AX and XA are equal ivhen X is zero, the unit matrix, a power 
of A, or a scalar matrix. 


The scalar matrix 


~v ■ 

. . . . " 

■ p 

. . . 

m • • * 



is a device for expressing an ordinary number p as a matrix. 
In this way, ordinary algebra can be thought of as a particular 
case of matrix algebra — when all matrices involved are unit 
matrices all of the same order. Thus, for instance, p^ — r/ 
== (p f q) (p — q) in matrix algebra would be — 

{P j- Q) {P Q) where P - - pi, Q ~ ql. Also pA — pi A = 
A(pl) ~ Ap, a result which incorporates and extends scalar 
multiplication as defined on p. 34. 

In the following examples capital letters mean matrices of 
order n, small letters mean ordinary numbers. 


EXAMPLES 

1. Why is A* — 52 4= (A i- B) (A — B) in general? 

2. Prove A^ - P ^ (A 1){A - /) = (A - /) (A -f /). 

3. A® - (X H~ (jl)A + Xg/ - (A - X/)(A - g/). 

4. If A, B are two-rowed matrices, and B commutes with A then 
J5 XA + [x/. 

5. If / (A) '/)oA 7 -f + . . . + A*?-' f . . . -f pnl, where 

• • •» Pt*- ®re scalar and qv& a positive integer, prove / (A) is a matrix of 

order n which commutes with A . 



72 


MULTIPLICATION OF MATRICES [Chap. IV 


6. If is another such polynomial, prove that {(7(x4)j“^ is another 
such matrix which conforms with A, provided the determinant | g(A)\ 
does not vanish. 

7. Prove f(A) x {g{A))—'^— (g{A))-^ X f(A). Why is the notation 

A f(A ) 

- ambiguous, but •' V ' not so? 

, gU^) 

8 . If I i? I 4= 0, and AB - BA, prove that AB”^ B-^A. 


9. UA = 


. — tan ^ 

2 

tan “ 


prove 


I\~A 

I~A 


fcosa — sinal 
Lsina cosaj 


10. Any two rational functions 9(A), 6(A) of a single matrix A com- 
mute with one another, and hence they only differ in behaviour from 
scalar numbers in failing to obey the division law . 

11. Prove (1 - AB)A (/ BA) = (I A B) A (I BA). 


r ^ 

12. Prove the product — c 

L b 


C —h’' 
0 a 
a 0 . 


“ ah ac " 
ah he is zero, 
.ac he e^ . 


13. If AR = 0 it does not follow that BA — 0. 

■ -4” ‘] [i: :]■ 

14. If X = [xij], A =: [a/;], B — [6//], . . . denote square matrices of 

order n, prove that YiXa denotes the sum of the elements in the leading 
diagonal ^f X. * 

If sx denote this sum, prove 

i // ” L L a ij hj i, Sx 7;(* — ^ L L a ij hj u Ckit 
i j - i j k 

each summation running 1, 2, . . .,n. 

15. Prove Sy^t ^bca — 

16. The sum of the elements in the leading diagonal is the same for all 
matrices X, Y, , . . where 

X=ABG ,..H, Y^BG..,HA,.... 
obtained by cyclic symmetry. 


17. Prove Sx = Sx'» 


18. Prove that, if 7i = 2, 


- dg 

dxii 

dy 

Lexij 


Sy - 

dxoi 

dy 

dX22^ 


= 2X, where y = Sxx* 


19 


'. Denoting this matrix in general by 

J 




: when « 


prove 

.X 



CHAPTER V 


Linear Equations. The Theorem of Corresponding 
Matrices. Further Theorems 

1. Matrices and Linear Equations. Rank. 

We now replace the definition of rank given in §5, p. 10, 
by a more practical statement. 

Definition of Rank. — A matrix of m rows and n columns has 
rank r, when not all its minor determinants of order r vanish^ while 
all of order r 1 do so. 

It follows at once by a Laplace development that not all 
minors of order less than r vanish, whereas all of order greater 
than r do so. Also r^n^r^m. 

Again, since non*zero determinants exist for all values of n, 
the unit determinant | 8,-, | being such an one, a matrix can have 
rank equal to the smaller of n or m. 

When r = n ~ m the matrix is non-sinfjular, and for the 
subsequent theory of invariants this is by far the most important 
case. When r < n or r < m the matrix is singidar. 

It can now be shown that the new definition implies the earlier 
property, that a matrix of rank r has r linearly independent rows 
and also r such columns, but not r j- 1 such rows or r (- 1 such 
columns. 

Proof. — At least one set of r columns exists between which a 
relation 

cola + /Xj col, ,+ ... + /i,.colj,= 0 . . (1) 

is impossible. For let tfie r columns a,b, . . .,g belong to any one 
r-rowed minor A,. ^ lafij . . . g,, | which does not vanish. Then 
the assumed relation (1) requires among others, the r equations 


fM^a + ]x^bj, + . . . 4 - Mh = 0 




74 


LINEAR EQUATIONS 


[Chap. 


Multiplying these respectively by the co-factors of a, , , a,^ in 
A,, and adding, we obtain 

I a,hj- . . .,9 a I + ^2 I • • • iZ/t 1 -1- • • • =" 0> Ml • (3) 

whence /x, = 0, since A,. =b Similarly each /x vanishes and no 
relation (1) exists. The corresponding case of rows can be treated 
in the same wa,v. 

Now let 9 , , q,, ... ,q^ be r + 1 arbitrary non-zero numbers, 
and • • • ft] be any minor matrix of order r -f 1 of a given 
matrix M, formed by attaching row k and column j) to those 
represented in A,.. Then, from the fundamental identity (IV) 
(cf. §12, Ex. 3, 4, p. 51) we deduce the further identity 

+ (“y- • •7’)-:/.. • • • -I (ab . . .q)ij,,,iPi 

=-- (a6. . .p),;; ,.a9... ... (4) 

Since M has rank r , this minor determinant (ab . . . p),j . . ^ of order 
r + 1 vanishes. Hence tlie left-hand sum also is identically zero. 
Thereby the r -]■ I elements a, , b , . . . p,- of row i in M are linearly 
related : namely 

\ + • • • H ' I 1 Vi ~~ ... (5) 

where X^ — (qb . , , p)^ &c. Here ^ at least cannot vanish 

identically because, in it, A^. is co-factor of Similarly the same 
relation holds for rows j\ . . . , i. Further it holds for any other 
row I, since 

Xitti -f Agh, -f- . . . -j A,.+ift {aqb... p)uj...k, . (6) 

which last vanishes on expansion by its column q, since each 
co-factor so formed is a determinant of order r + 1 belonging to 
M. Combining these results, the r 1 columns are linearly 
related ; namely 

^1 col„ -f- Aj col,, -f- . . . -f A,..,.! colj, = 0. . . (7) 

This expresses any column p in terms of r suitably chosen 
columns. Similarly we prove any r -f 1 rows to be linearly 
related. This proves the theorem. 

Corollary . — A mairix and its transposed have the same rank. 



V.] 


SOLUTION OF LINEAR EQUATIONS 


75 


2. Application to Linear Equations. 

A matrix M of n rows and m columns is closely connected 
with the theory of n homogeneous equations linear in yn variables, 
or m such equations in n variables. 

Thus if ( 1 ^ ~ aj . . . + ^^u^vith like abbreviations 
6^, c^, . . . , the matrix 



■«! 

^2 • * 

a,,' 


W 

bo • * 

K 


-h 

A*2 • • 

• K. 


is that of the m linear equations 

= 0, ... = 0. 

To say A row^ + fi row'j^, + • • • “ 0 is in effect the same as to 
say 

Xa^ + fib^+ ... = 0 {^1, 

namely that this is an identity for all values of x. Hence the 
rank r of M' determines exactly how many of the equations are 
effectively independent. So among a^, 6^, . . . , exactly r inde- 
pendent forms can be chosen. Let them be the first r : — 
• • • > 9x’ among the columns exactly r are indepen- 

dent. Let them be columns 1, 2, . . . , r. As we are dealing 
with equations, there is no loss of generality in making these 
assumptions. 

If now^ r ~ 71 < 771, we have 7i equations = 0, 6^ = 0, . . . , 
— 0 whose determinant A (a6 . . . A) 4= 0. Multiplying 
each equation by the co-factor in A of the coefficient of and 
adding, we get x,- A == 0. Hence x,- vanishes for each yalue of i, 
and only the zero solution = Xg — , . . ” x„ — 0 exists. 

Next if r == n — 1, so that there must be n — 1 independent 
rows, there is exactly one solution for the ratios x^ : Xg : . . . : x,^, 
namely 

x^: x^i . . . : K^: K^: . . . : /i,„ 

where these are the W— - 1 -rowed determinants in the n — 1 
by n matrix of these n — 1 equations. This follows in the same 
way by multiplying the n — 1 equations respectively by the 



76 


LINEAR EQUATIONS 


[Chap. 

co-factors of a, 6 ... in the determinant (ab . . ■g)m...n-v 
adding. 

Further if r < w — 1, we can solve the r equations for just 
Xi, x^, . . . , X,. in terms of the remaining n — r homogeneous 
variables x ,.^■^, . . . , x,„ again by use of co-factors of elements 
in a column of the non-vanishing determinant 

EXAMPLES 

1. The rank of the system of equations 

2x 2y 3z — t~ 0 
z — St =0 
Sx + + 52 - 3^ = 0 

is two. We could express two of x, y, 2 , t in terms of the others, but not 
any two. We must exclude the case, x, y. 

2. Given two equations 

Ui x^ -f “I" CiX.^ 4- = 0 

a,iX^ + ^ 2^2 -i- -h <^2^4 0 


the most general linear equation derivable is 

(ap)xi 4- (bp}Xz 4- [cp)x^ -f- {d2))x^ = 0 

where (ap) = aiP 2 — (I 2 V 1 P 2 arbitrary. We eliminate or x, 

or ^^3 or x^ by putting p b, c, d in succession, provided not all of the 
determinants (ab) vanish. 

3. Given three equations 

UiXi h ^1 ^2 4 - C1X3 4 - dlXJ^ — 0 

a^Xi 4 - 4 - 03^:3 4 - d^X/i — 0 

prove (apq)xi 4- (hpq)x., 4~ icpq)X:^ -f- (dpq)x^ 0 where (apq) = S i (hp^q^, 
and the six elements p/, qj are arbitrary. By putting p, q equal to two of 
Uf b,c, d we eliminate tw o of the a;’s from the equations, provided not all 
determinants (abc) vanish. 

4. Prove similarly that r — I unknowns Xi can be eliminated from r 
such equations whose matrix is of rank r. 

5. Prove the fundamental identity 

(abcg)px — (abcp)qx =■-- Xbcpq)ajc + {capq)bx -|- {abpq)cx, 

where px = -f p^x^ + p^x^X- P,*,- H the matrix [abed] haa rank two, 
prove (abep) = 0, (abeq) — 0. 

This fundamental identity now explicitly indicates the linear relation 
between three equations Ux = 0, 5a; = 0 ^ ^ 0, whose rank is two. 



V.] UPPER SUFFIX NOTATION 77 

3. The Upper Suffix Notation. 

Le6 iis denote the elements of the reciprocal determinant 
by , m*', in defiance of previous practice which 

reserves this notation for powers of a. The context will make 
it clear what is meant. So at present we shall understand the 
indices 1 , 2, 3, . . . , n to be distinguishing marks like the accents 
which are probably more familiar 

a, a', a", .... 

By this means we can exhibit a wonderful parallelism running 
through the whole theory of determinants, starting with the 
obvious pair of conditions 

A ... m,, 

Since A is by definition a\ and since 
A — + . . . + 

we have 

\ — a^a^ “f + . . . -r • 

If A' is the co-factor of a* in A”"^ we find .4' X A = a,. This 
is soon proved. For, multiplying rows by rows, 4^ x A 


1 

. 

. 

. . . . 



h 

Cl 


a2 

62 

C2 

... 


«2 

b^ 

^2 

d. 

a3 

6* 

C^ 

# ... 


O3 

h 

^3 

ds 

a* 

6* 

C^ 

... 


04 

K 

<■4 

d. 


02 

03 

a^ ... 






. 

1 

. 

. . . . 





. 

. 

1 

• ... 

= 

dl , 




. 

. 

. 

1 ... 





• 

• • 

• 

.- . . 






and similarly for a^. 

This leads to a general theorem, due to Jacobi: ^ 

Each minor of A is 'proportional to the corresponding cample- 
n\entary co factor of A”^, the ratio being A. 

The full significance of this remarkable theorem is best 
1 Cf. CreWc, 12 (1834), 9. 



78 LINEAR EQUATIONS [Chap. 

seen by taking a particular case, when A= | ayh^c^d^\, and 

«i _ _ ^1 I _ I ^ 1^2 1 _ 

\¥^\~ \aW\~' \a^<^¥\ Ic3#|' jas#! 

_ I <hK<^z I _ I <hK<^z^4 1 _ 1 

“ # ' 1 [aWd^l* 

There are altogether 70 different equal ratios involved here, 
while each rati(3 involves all the letters and all the digits 1 , 2, 3, 4. 
In general we have 

^ ^ 1_ 

\gK,,7n"\ 1 I ’ 

where, to fix the sign, both letter and suffix rows are algebraic 
complements 

ah . . ./, gh, . , 7n, 


and the partition is taken in every possible way. 

The proof is immediate for any particular case, by the multi- 
plication theorem. Thus | 1 • I I I I • ^ since 


c3#| \a^b^c^d^\^ 

0^1 ^3 

_ 6i 63 

. . 1 


1 

. 

. 

. 


«1 

(3^2 

03 

«4 

. 

1 


. 



h 

bz 

h 

a* 

63 

C3 



Cl 

^2 

C 3 

C 4 

a* 

¥ 

C* 



d, 

d^ 

dz 

d4 

“4 










= 




- 1 

a A 

I- 



1 


. The only difficulty in this proof is to decide how to make 
the minor look like an w-rowed determinant. An easy way to 
remember what to do is to notice how two complementary blocks 
of the unit determinant. 



are utilized in the course o{ the work. 



V.] 


THE JACOBI RATIO THEOREM 


79 


To prove | | | 1 == 1 ^2^3 I the order c, a, 6, d 

of the letters has been disturbed, write | ^^62^3^4 1 1 ^1^2 ^3 ^4 I 

and proceed as before. 

It is important to recognize this theorem in the form of the 
adjugate determinant and its minors. Thus if capital letters denote 
co-factors of small letters in A, the following would be typical 
when A -- | a^h^c^d^ |; 

|a,6,|A ^\CM 

A8=|^,J5A.D4l, 

and so on. 


4. The Theorem of Corresponding Matrices. 

Next consider two arbitrary square matrices A and X of 
order n, 


A---= 

■«x 

... »«j- 


-xj ... 




. . . 


~ • • • 

tn- 


together with their transposed matrices A' and X\ It is assumed 
that the determinant | .4 | 4^ 0 . 

By the same device of interchanging ’ corresponding rect- 
angular subsections of these we obtain another important 
theorem, which is illustrated well enough again by taking 
n 4. 

First replace the top row of A' by that of X\ raise the suffixes 
of other rows and multijdy the determinant so found by 1 ^ |. 
Then 


Xi X2 X.f X4 


^ ^ i 


^JC ^x 

fei 62 6» b* 


0-2 ^^2 ^2 ^2 


. 1 . . 

c2 c® C* 


^3 ^3 ^3 


. . 1 . 

# d® (i® # 


&4 C4 d^ 


... 1 


( 8 ) 


which is obvious otherwise by expanding the first determinant 
by its top row and using the previous theorem. In fact 

{x^ \ h^c^d^ I H- X2 1 b^c^d^ \ + &c.) | a^b^c^d^ \ 

= x^a^ 



cr2 *4 




^x ^x 

Vi ?/2 Vs Ui 
c * 

(fl ^4 


^2 ^2 ^2 ^2 
^3 ^3 ^3 ^3 

H K ^4 

= 

^'1/ ^// ^y 

. 1 . 

... 1 


8o LINEAR EQUATIONS [Chap. 

Next do the same for the two top rows of A' and X'. This gives 


(9) 


This last is so important a form that it has a special notation 
a, by — tty 6* = (oh I xy) (10) 

Again expand tlie first determinant in (9), this time by its two 
top rows, and use the previous theorem. It gives 

= 2 (a;y)i2 (a6)i2 

summed to six terms ( = 4 !/2 ! 2 ! ). Thus 

by — Oy b,^ {ab | xi/) === E (xy)ij (ab), j, . (11) 

and as this proof equally well applies when A is of order n, and 
there are n — 2 rows below the top two, we may sum this last 
for 

Ij 2, 3, . . . , n,] ^ . 

j=:l,2,3,...,n,j 


Similarly for three top rowh 



X2 

X3 

a"4 


«1 

h 

Cl 

rf'l 


a. 

K 

C;. 


Vi 

2/2 

Vz 

y* 


(12 


C 2 

^2 


S 

K 

S 

d,i 

«1 

*2 

^3 

24 


«3 

^3 

C3 




K 


dx 




d* 


«4 

^4 

C 4 

^4 


* 

• 

. 

1 


= E ± 6,, Cz -- (abc | xyz), say. 

Also on expanding the first determinant by its three top rows^ 
it gives 

( {xyz)^2s + &c.) (abed) 

== (^2/2:)i 23 («^<^)l23 

to four terms ( ==; 4 !/3 ! 1 !). Thus 

E ± by c, ^-= {ahe \ xyz) == E (ahc)^j^ {xyz)ijt • 

^’=1=^". j^k. j 


• (13) 



VJ THEOREM OF CORRESPONDING MATRICES 8i 


Lastly, if we remove all rows of A' we revert to the ordinary 
product theorem of determinants: 

Xi X2 x^ x^ ai Cl rfi 

2/1 2/2 2/3 2/4 «2 ^2 ^2 (h 

h ^2 ^3 2J4 63 C3 ^3 

^2 ^3 ^4 ^^4 ^4 ^4 ^4 

~ (abed I xyzt) say. 

We collect these results, which will turn out to be very 
significant, and now have the following n identities involving 
elements of two square matrices of order n: 

... + a,^x„, .... (15i) 

= (a6| x?/)n=S(«6),;;(a:y),;;, (IS^) 

h I a;y2) = S (a6c),;Ja;y2), (ISg) 

a, h, c, 

hj, c , . . . m, 

“// •• • — (ate... m j xyz...t) = {abc...m){xijz...(). (15„) 

Uf bf 0^ , , . nif 

j, . . . = 1, 2, 3, . . . , 71. 

Here there are terms on the right of the rth identity, obtained 

by choosing the r suffixes in all different combinations from among 
1, 2, 3, ... , n. 

Determinants of this type on the left, but of higher order 
than n, vanish identically. For by §12, (V), p. 50, the elements of 
the (n + l)th column are each the same linear function of the 
first n columns. Hence the (n + l)th column is linearly related 
to these columns. 

At the outset we have assumed that | ^ | =4= 0, but the above 
results hold for all values of the elements bj, . . . , x,-, . . . 
concerned. For if in the rth identity the 7’ columns a^b, . , , ,k 
are linearly related, then 

0 6; "b . . . ca — 0, i “ 1 , 2, . . . , 71. 

( D 8S4 ) 7 



^x 

ay by Cy dy 

b, d, 

df bf Cf df^ 



82 


LINEAR EQUATIONS 


[Chap. 


Multiplying by and summing for we have 

so that a^ , b^, . . . , Jcj. are also linearly related. Consequently 
both sides of relation 15,. vanish identically. 

But if a, b, , h are not linearly related, at least one 
r-rowed determinant of the matrix [a, 6, . . . , A*] does not vanish, 
I >4,. I say. Let [/ . . . m] denote the (n — r) -columned matrix, 
formal dual to [a, 6, . . . , i*] in ^ [ab . . . ni\. By choosing 
[i . . . ni\ to be zero except for its diagonal elements comple- 
mentary to I I which we take as units, we make 

M I -■= M.- 1 + 

Thus the original assumption is covered. 

These results may be put into a slightly different form, of 
great importance in the theory of matrices. 

If A is any matrix ofn columns^ and B is any matrix of n rows, 
any x-rowed determinant D of the product matrix AB is equal to a 
sum of terms each a product of an x-rowed determinant of A and an 
x-rowed determinant of B. 

For this is a statement of formula (15,.) when 




. . 


"Xi 

y\ • 


A = 

6i h, . 

. 6,. 


.^2 

Vz • 

• 






Vn • 

• • « 


Here A and B need not be square matrices. 

5. Inner Product of Two Rectangular Matrices. 

The determinants {a\x), (ab\ xy), {abc\ xyz), . . . elaborated in 
the last article have many useful properties. It will be seen 
by interchanging any pair among abc ... or among xyz . . . 
that the sign of the whole is changed; for this amounts to an 
interchange of rows in the second or first factor of the original 
product. Thus 

{abc I xyz) == — (bac | xyz) = — {abc | yxz) ==.... 

Such a property is summed up by saying that {abc | xyz) alternates 
in both abc and xyz. Further, it is symmetrical in these two 
groups, thus I I 



COMPOUND INNER PRODUCTS 


Once more, it has exactly the same suffixes ijk for ahc and for 
xyz in any particular term of its expansion 

2 {abc);j,, (jcyzhjie. 

For this reason it is an inner product of two sets of quanti- 
ties, namely the set of determinants of the three-line matrix 

di % (i^ ... "1 

\ ... 


and the corresponding set 


Xo Xo Xa 


©3'= 2/1 2/2 2/3 2/4 


Likewise for each of these functions (a|x), {ah\xy), .... The 
typical one can be called rth compound inner product, or the 
inner product of two r by n matrices, r giving the number of 
different symbols before or after the vertical line. Further, these 
matrices are subdivisions by rows of the transposed of A\ X' 
the square arrays from which we started. 


6. Laplace Developments of the Inner Products. 

First we may write any such expression as 
iflbc 1 ryz) = 1.±a^by c„ 

and in the notation of p. 27 this can be written 
{abc I xyz) = b,, c, = 6; c;, 


where either a, 6, c are deranged, or xyz. By a (2 
development this is also * 

{ah I yz) + {ab j zx) Cy + {ab | xy) c^, 


1) Laplace 


and this principle may be extended to any order involving r 
pairs of letters a, x; 6, y ; &c., if r < w. 



84 


LINEAR EQUATIONS 


[Chap. 


7. Bank of the Product of Matrices. 

The rank of the 'product of two matrices cannot exceed the rank 
of either factor. 

For in the result of § 4 , p. 81 , if all r*rowed determinants of 
A (or of B) are zero, the same is true of all r-rowed determinants 
of their product AB. Again 

The rank of a matrix A of m raws and n columns is unaltered 
by multiplying A fore or aft by a conformable non-singuhr square 
matrix. 

For if r is the rank of A, and jB is a square non-singular matrix 
of order n, the rank p of AB has just been proved to be not 
greater than r. Likewise the rank of {AB)B~^ cannot exceed />, 
that of its first factor. Hence p~r. Similarly for a product 
with A as after factor. 

8. The Simplex. 

We now consider a theorem analogous to Jacobi’s theorem, 
which may first be illustrated by the case of four rows and 
columns. Suppose we have, as before, two double sets, say 

^2 ^3 ^4 ^2 ^3 ^4 

h h h h Vi Vz Vz 2/4 
so chosen that the matrix 



vanishes identically: thus a^ ^ b.^ 6,, = 0, or, in full, 

«i + ^2 ^2 + «3 i^3 + ^4 0, 

6i x ^ -f 62 2^2 + h ^3 + ^4 
and similarly for y. Eliminating x^ we obtain 
(ct6)i2 ^2 (®^)i3 x^ {ab)-^^ x^ = 0. 

Similarly 

(«^)i 2 2/2 + (a^)i 3 2/3 + («^)l 4 2/4 = 0 - 

These are two equations for the three quantities (a6)j2, (06)59, 
(06)54. Hence, by solving, 

(06)12 : («^)i3 : («^)i4 = {xy)ai • {xy)^ : (xy)^, 



THE SIMPLEX 


V.] 


85 


and for similar reasons, by eliminating *2 or originally, 
we obtain 

(a6)l2 ^ (06)13 ^ ^ {0^)23 ^ ^ (^)34 

(a :?/)34 (^-y)iz (*j/)i 4 ( 3 :y)i 3 i^y)i 2 


These formulae, which are fundamental in the study of line 
geometry in threefold space, are typical of a general set involving 
complementary rectangles P and w, say, from two square matrices 
of order n. As an illustration let n — 5, and let the matrix 

r a* % a , ] 

IK K K\ 

vanish identically. Here five sets o, 6, x, y, z are involved, and 

o, 3 Oi + 02 ^2 -h 03 a-g 4- 04 374 + Oj ajg = 0. 

As before we obtain, by eliminating x^, y^, from the three 
pairs of equations, 

(06)12 3^2 + («^)i 3 ^3 + («^)l 4 H + («^)l 5 = 0, 

(06), 2 y<t + (06)13 yz + («^)i4 ^4 + (afe)i5 yh = 0, 

(06)12 ^2 + (aft)i 3 % + (ah)i 4 24 + («^)i 5 25 = 0, 

which are just enough equatioUvS to determine the ratios 
{(ib\2 • (^*)i3J terms of x, y, z. Multiplying these equations 

respectively by (yz)^^, (2:^)45, and adding, we deduce 

(06)12 (^F)245 + («*)l3 (^F)345 = 0, 

or 

{(lb)io • (<^fe)i3 = {^y^)uh • (■^2/^)425 j 


and by symmetry 

H)i2 ^ H)/./ 

(^ 2 / 2)345 (•^F)425 *” {^y^)klui 

where y, khn are algebraic complements of 12345. 

In general let there be r sets a, 6, . . . , k and n — r sets 
x, y, , . , y t such that the inner product of any two sets chosen 
from these different groups vanishes. Then the determinants of 
these two matrices are proportional, namely 



LINEAR EQUATIONS 


86 


[Chap. 


where ajS . . . 8, Afx . . . per are algebraic complements of 123 . . . n, 
and so are the accented suffixes. 

Lastly consider the double set arranged in two rows each of 
n letters, 

a b c ... I 7n> 

X y z ... s t 

where each letter denotes a vector, say 
d * • * > 

X= {xi, X2, . . . , X,,} 

and the inner product of any two vectors not in the same row 
or column vanishes. When none of the inner products a^., 

. . . , m, vanish, such a system of vectors is called a simplex 
of the nth category: the vectors of one row, it matters not which, 
are called points, and those of the other row primes. Further, 

let p^, p^y ... y Pn^i denote the sets of determinants (a6),y, 

(ahc)fji, . . . : respectively, and TTg, 773, ... , the sets (xy);i, 
(xyz)ij^., .... Then the preceding results can be written in the 
form 


: = 7ri 


II 


Pn-^l 

7^2 


= k'(c.. 


■Pn-i 



- V'{d. 


Vn--i 

^n-1 


..p = 


Pj = m 


where the suffixes are algebraic complements of 123 . . .n, and 
the coefficients k are constant for all such suffixes. 


EXAMPLES 

1 . If A = {ahe . . . m) and D = {xyz , . . t) denote the determinants of 
the above double set of vectors, prove = ajc by Cz. , ,mt. 

2. Prove that each column of A is in fact a multiple of the corre- 
sponding column of co-factors of D, and vice versa. 

3. Exhibit the above relations when n = 3, and interpret them wlien 
X, y.z are three vertices of a triangle whose sides are a, 6, c. 

4 . If n = 4, show that the simplex is a tetrahedron of four vertices 
Vf 2, t, four planes a, b. c, d, and six lines. 

~ ^ homogenous equatif)n of a plane in point co-ordinates 
i 54)' or of a point in plane co-ordinates [61, b2.b^,b^]. The conditions 



THE CAUCHY-SYLVESTER THEOREM 


V.] 


87 


imply that points ?y, z, t but vot x are on plane a; and so on cyclically. The 
P ■= [ («6)i2- («^)23' Wil] 

is proportional to the set 

It = (■*y)i3. (•»:y)i2] 

as in §0 above. The set p is called the set of axial co-ordinates of the 
line common to planes a, h: tlie set tt forms the set of line co-ordinates. 
These p and tt sets are often taken to be identically equal since the intro- 
duction of a non-zero common factor to homogeneous co-ordinates does 
not alter the actual point, line or plane represented. 


9. Extended Form of Cauchy’s Theorem, commonly called 
Sylvester’s Theorem on Compound Determinants. 

The theorem that | | — \a}-h^ . . . m" j = 

I ^ 1^2 • • • I special case of a remarkable general theorem 
virtually due to Cauchy.^ Let | {ab)ij | denote the determinant 


of order ^ 2 ^’ combinations like ab to typify columns 

and the j.Cg ij to typify rows. Let us call this the second 
compound of A, and denote it by Thus if n ~ 3 


(afe)i2 

(ac)i2 

(6r)i2 

D.2=\(ab)ij\--=- (ab)i3 

(«c)i3 


(<ib)23 

(ac)23 

(^C)23 


To avoid ambiguity let the alphabetical and the ascending order 
of letters and suffixes be maintained. If n — 4 there are six rows 
and columns. 

Similarly let D.^ = | {ahc),,k | be the third compound of A, 

denoting the corresponding determinant of order ( Thus if 
n — 4 its leading diagonal is ^ ^ 

( rt 6 c )|23 {^^) i 34 (^^^)234 • 


And so on until finally A itself is reached. The Caiichy-Sylvester 
theorem is this: 

Each determinant D,. | («^ •••)*;... | ^ positive integral 

power of the power for the ith compound being 

This is proved by considering an adjoint determinant whose 
elements are co-factors, in the original determinant A, of the 
elements (a6 . . . 

^ See Muir, HuHory of Deter mimn Is, 1, 118. 




88 LINEAR EQUATIONS [Chap. 

Let be such co-factors. We 

form the two determinants 

I I 

and multiply them together, column by column, not row 
by column. The resulting inner products which go to form 
elements of the product determinant are all determinants of 
order n, because the inner products are actual Laplace develop- 
ments of these determinants. The leading diagonal determinants 
are all equal to A, and the others vanish as in the original 
Cauchy theorem. 

To illustrate this let n — 5, A — (abcJe) and the co-factor 
determinants be | (ab)fj | , | {cde)j^iui | • Then 


(o6)i 2 (ao)i2 . . . 


(«Ze )345 (Me)345 ... 

(a6)i3 (ac)i3 ... 

X 

(Crfe )245 (We)245 ... 


{abcde) (abbde) . . . 


A . ... 

(accde) (abcde) . . . 

— 

.A ... 


In general the result of multiplying these two determinants each 


of order 


is A 


CO 




But A has no polynomial factors (§4, p. 33); hence A'" ^ also 
has none. Accordingly both determinants on the left are powers 
of A, to a numerical factor. Taking the special case when 
A is I 8;^. I , the unit determinant, the numerical factor is seen to 
be unity: and further, since, in the left-hand factor determinant, 
the letter a enters into the columns, and therefore into each 

term in its expansion ^ times, it follows that 


Z).-A 


cil) 


The other factor must be which is equal to 

For by the theory of indices, 

We may now summarize this Cauchy-Sylvester theorem for all 



V.] 


THE GENERALIZED RATIO THEOREM 


89 


compound determinants derived from a given determinant of 
order n as follows: ^ 

.a("T‘) 




Z»3 -A 


(“ 7 ') 


-A. 


Thus for n -- 5 this set gives A, A^, A®, A^, A. 


EXAMPLES 

1 . Prove the analogous reciprocal results for upper suffixes; 

I (ah)*j I “ l/i )2 “ I (o6c)oA' I = I/D 3 , &c. 

2. Extend the Jacobi ratio theorem to cover the case of minors of 

reciprocal adjugate determinants, e.g. | 1 and |(ai)0'|. 

10. The Generalized Ratio Theorem. 

Starting with the Jacobi ratio theorem (§3, p. 78), which the 
quaternary case illustrates, 

«i I «i ^2! ^ ^ 

h^c^d*\ ■|c»#| ■■■ IrtifcV#!’ 

we can adjoin further equal ratios as follows. Take any number 
of arbitrary sets 

{ x} ... [ , 

••• y"}. 


and multiply . single-letter numerators of suffix i together with 
their denominators, by x'; double letter numerators \(iibj\ 
together with their denominators by | [ ; and so on. Then 

sum the numerators forming 21 and sum their corresponding 
denominators S x' [ W | , Also sum E | a6 1 | xy | and their 
denominators. The result is to obtain further ratios equal to 
the original, such as 

(a;l«) 



LINEAR EQUATIONS 


90 


[Chap. 


And if by (xthd)' we understand this upper suffix determinant 
in contrast to (xbcd) which means | C3 ^4 1 , we have 


I aib2C3(ii\ = ((ibcd) = 


{x I a) 
(xbcd)' 


{x\b) ^ (xy I ab) 

(xcady (J^'yody 


1 

(abed/ 


for all values of x, ?/, .... 

A correlative stati^nient, involving arbitrary sets Wg, . . . , 

[I’l, • * • > • • • > with lower suffixes, follows in exactly the 


same way: 


(abed) “ 


(uvtva) 
{iivw I bed)' 


{uvwb) __ (uvab) 
(uvw j ca^' (uv j cd)' 


1 

(abed)' 


11. Tensor Constants of the Fundamental Identities (pp. 44-48). 


In the fundamental identities certain matrices R, L, M, N 
enter. The columns of some are deranged from term to term, 
those of others maintain their relative position. We shall call 
these latter matrices the constants of the identity concerned, 
while the others, resolved into their ultimate columns, will be 
called the variable vectors or simply the variables. 

Thus, for example, in §8 (30), p. 43, which leads to identity 
(31) namely to 

{ahcd)(efg}i)-=:{defa)(bcgh), . . . (16) 

the constant is 


[gh]^ 


'9i K 
9z ^2 
9z ^3 
-9i K. 


(17) 


Any identity arising from deranging a product of p such 
determinants, as in §11, (I), (II), (III), p. 48, evidently has p — 1 

constants M, N, ... . They have respective currencies j, Ic 

Now consider the above example, with each second factor 
developed by a Laplace expansion as 

{efgh) = i:(ef)j^^{gh),, (18) 

p, y, r, 1,2, 3, 4, 

Since (16) is an identity for all values of g^, A,., in particular, we 



TENSOR CONSTANTS 


V.] 


91 


may put <7, = A, == 1 with all other six,elements g and h zero. 
Hence 


{abcd)(ef)^„i=- (defa)(hc)i„^ . . . (19) 


where p, q are any two among 1, 2, 3, 4. 

Now let a set of six arbitrary quantities be chosen 


^ ^ [^12» ^13> ” [^/J- 


( 20 ) 


Multiply (19) through by 6?,,. and sum for the six sets of values 
of jxq, rs as given by (18), finally writing 

{efG) for 

Then 

(abed) (efG) ~ (defa) (bcG). . . . (21) 


Here we have a slightly different form of the identity. It now 
involves a constant set G defined by six elements as in (20), but 
not necessarily by eight original elements as in (17). 

In exactly the same way the identities (I), (II), (III), §11, p. 48. 
may be treated. The constant matrix M of currency j may be 

replaced by an arbitrary set of elements M 




...] == 




( 22 ) 


Similarly for the other possible constants in the identity. 

And again, returning to identity (19), if we multiply each 
member by an arbitrary quantity IP*'*, taking 

- ip^ m^] = [w% . ( 23 ) 


and if {ef\ H) means 2 (ef)^,^jH^''^, we may write the identity as 
{€M)(ef\H)^{defa){bc\H). . . . (24) 

Thus (16), (19), (21), (24) are essentially the same identity as 
far as the variables a, 6, c, d, c,/are concerned. 

Definition of Tensor. — For a given category n, a set of 
quantities rj whet^e the suffixes take all values. 1, 2, ... , n 

is called, a tensor of order j. A vector is a tensor of order 
unity. 



92 


LINEAR EQUATIONS 


[Chap. 


12. Application of the Principle of Duality. 

Let us make formal duals (§13, p. 54) of the vectors a,b,c. : 
in other words we consider each vector to furnish a column of 
a perfectly arbitrary n-rowed determinant, and take as formal 
dual of a the set of n co-factors of the column j ^ ^2 ^ • • • ? ) • 

Then if this determinant is written (a | a), the set 

is formal dual of 

{« 1 > « 2 . •••.«»}• 

Now let Greek letters a, y be formal duals of a, b, c. Then 
there will be a compound M-rowed determinant 

(o^y . . . p) = 2 ± a^^y® . . . fi", 

formally dual to 

(abc . . . m) — 2 i a, 62^3 . . . . 

Manifestly fundamental identities exist among such deter- 
minants. Thus if w = 4 we have as dual of (16) 

(a^yS) {(iriQ) = (Se^a) (/Sy^jO). 

In particular as in (19) 

(aj 8 y 8 ) 

Hence if we use the natural notation (e^//) for 2(e^)'"'H''* and 
we have the following correlative with 

(21) and (24), 

(a^yS)(€^H)=( 8 eCa)() 8 yH), 

(a^yS) (e^l G) = (8e^a) (^y | G). 

The important thing to notice is that in every detail the original 
identity is matched by a dual identity. They only differ in two 
respects; 

(i) A lower suffix becomes an upper suffix, and vice versa. 

(u) Variables are replaced by formal duals, as shown by 
writing Greek for italic letters. 



V.] THE DUAL OF THE SYLVESTER IDENTITY 93 

In exactly the same way any fundamental identity can be 
reciprocated into a dual, and there are in fact eight different 
modas (four direct and four dual) of writing such an identity. 
In all these modes the variable vectors are permuted in precisely 
the same way. 

The case when n 5 illustrates this well enough: 

{abcde) (fyhuv) — (abefy) (cdhuv) 

{(ibcde) = (abefg) 

(abcde) (fgP) =- (abefg)(^P) 

, (abcde) (fg [ Q) = (abefg) (cd ] Q) 


where P — - [P123, . . . , -P345]) Q — (^*■’1 • • • > 


are sets of 




10 arbitrary constants. 


(a^ySe) (^ijO^i/r) — (ajSe^jj) (y 80 <^i/i) 
(ai9y8c’)(^,;r (a^eCr,)(y8y"> 

(a^y 8 e)(igR) = (a'^eCr,) (yk R) 

, (a^ySe)(lg\S) — (a^€ ^17) (y8 ] S) 


where R = [R^^, R^% S - [S,„ .... 
are sets of ten arbitrary constants. 

13. The Sylvester Identity. 

We conclude this investigation of the principle of duality 
by making a dual transformation of the Sylvester identity, 
§ 9 , (II), p. 45 , which was stated in the form 

(A,C,)(b,F,)^(D,C,)(A.F,) i+k^n. . ( 25 ) 


On the left are 




terms, due to derangement of k columns 


and i columns D^. Without loss of generality we can assume 
i < k. 


Among these terms on the left is while in all 

the rest some columns of appear in the same factor as A^, 



94 LINEAR EQUATIONS [Chap 

Let us rewrite this identity with this first term transferred to 
the other side as 


V V 

7-1 


= (A,F,) (D,C,}--(A,C,) (D,F,), 


(26) 


where q denotes the number of columns of transferred. For 


each value of q there are 




terms, because this is the number 


of choices which can be made from D, and The notation 
denotes the matrix A combined with q of the columns of D, 
Similarly for the other suffixes. The double summation sign is 
used rather than the dot notation for reasons of convenience. 

Example . — If n = 4, A = oft, D ~ cd, C =- wt, F ~ uv, 
then such an identity is 


(abuv) (cdwt) — (abtvt) (cduv) 
= (abet) (divuv) — (abew) {dttiv) 
— (abdt) (ewtw) -f {obdw) (ctuv) 
+ (abed) (uvwt). 


On the right, there are four terms answering to — 1, due to 
derangements of c, d and of t, w independently. We write these 

as (abci) (dwuv), with dots and bars to distinguish the two deter- 
minantal permutations. So the identity now runs 

(abuv) (cdwt) — (abwt) (cduv) — (abet) (dwuv) + (abed) (mrwt). (28) 


Since the eight columns a, 6, . . . , v of these determinants 
are quite arbitrary, let us take u^, if respectively as the 

co-factors of in the non- vanishing determinant 

(^Tj I^(d). So from the table 

uvwt 

$ 7) ^ (O 

we deduce = (ij ^60)234 , % ^(0)2^ , &c. Then by the 

Jacobi ratio theorem (Mt))i2== Hrj^co) &c. Consequently 

(abuv) = {ah \ Coj) ^tu), {abet) = — {abc | 0. 



V.l PROOF OF THE SYLVESTER IDENTITY 95 

If these are substituted in (28) and the gommon factor {^rj 
is removed we obtain, as in §8, p. 89, 

{ab I (cd I iyj) — (ab | f ij) (cd | 

— — {abc\$riI^){d\o)) -j- {abc\^r]a}) (d\ Q -|- {abed) 

- - (abc I ^/ 0 (rf I w) -I • {nhed) {$r)^u>) (29) 

This is a form of the Sylvester identity in terms of matrix inner 
products. Since it is a polynomial identity which holds for all 
values of the clemeTits concerned provided (^17 ^a>) does not vanish, 
this last restriction may be removed as in the case of §4, p. 82. 
The formula therefore holds without exception. 

14.^ Formal Proof of the Sylvester Identity. 

More generally, by the same methods, we may transform 
identity (25) to a relation between columns of four matrices 
A, B, P, Q ot the same currency. For if 

A I = Uo . . . . Bi=b^bo-.. 6,; , 

Pi=- Qi=yiyi...yi 

then 

i • • ^ 

2 (ffli . . . a; 61 . . . 6 ,J Xi . . . x, 

X ( 6 ,^+ 1 ... 6,1 ^ 

= («i •••«.- 1 yi •• • yd {bi...bi\xi... X,) 

- (fli . . . a, I a:i . . . x.) (fej . . . 6 , | . . . y,). , 

This can be written mare shortly as 

=.{^,|P,)(B,1Q,)-(^,|Q,)(B,|P,). . (31) 

If 2i > n, the upper summation limit is n. 

Proof — 

The result follows from (26) by the Jacobi ratio theorem, 
exactly as in the quaternary case just considered, provided 

2i < n. 

* This section may be omitted on a first reading. 



96 LINEAR EQUATIONS [Chap. 

We take 2i -t- j == n, and work with two dual matrices of order n 
[ a *! 0 ^ 2 ... 2 ^ 1 ... 2:^1 

[%?/2 • • • • . •'Wj ^1 • • • 

where the lower is the adjugate of the upper. For instance, the 
ad jugate of the minor determinant 

is 

where A, A' are algebraic complementary suffix rows. Similarly 
that of is mv^ . . . v,^w ^ . . . 

Hence the series on the left of (31) is equal to 

S ( _ _ a; V . . . . 1 ?, u\... Wj) 

X . . . 6 , u, . . . UiV ^ . . .7;,^ . . . tv;). 

Since q^ — q is even, the sign simplifies to ( (“)^ This 

makes the series a Sylvester series as in §11 (26), p. 94, so that 
it is equal to the two terms 

(—)'{(«!• ••«{%•• -Mi «’i- • •«’;) 

X (&,,+! b,j v,i + , . . . Vi t\ . ..V, I w^... Wj) 

— («1 • • • a.- Wl . • • Wj) ( 6 , . . . 6 ; M, . . . M; W^... Wj) }• 

which simplify to 

• • • »«1 • . • Wl . • . ) . . . ) 

- (fll . . . Vl . . . . . .) ( 61 . . . ^fi . . . W, . . . )}. 

or finally to 

(«i • • • «i 1 2/1 • • • Vi) (bi...bi\xi... Xi) 

- (ai . . . (i,. I 3^1 . . . r,.) (61 . . . I . . . y,.). 

This proves the theorem, provided 2i ^ n. 

To prove it if 2i > n, we merely take the theorem for original 
determinants of order 2i, where all the elements of the rows 
« -f- 1 , M 4* 2, . . . , w + (2f — w) are zero. For this automati- 
cally turns the inner product S x^ summed for Z- — 1 , 2 , , 2i 



TERNARY EXAMPLES 


97 


V.] 

into one for 1, 2, . . . , n, so that any compound inner pro- 
duct may be interpreted indifferently as of either category 2i 
or n. It further ensures that /’th compound inner products when 
r > n should vanish identically. This is needed when n > 
in order to remove the terms of (31) for values of y between 
n — % and 

Example . — If in (29), p. 95, ^ ~ 0, the 

quaternary identity leads to the ternary identity 

{d) I Coj) (C(Z I fij) - {ab I ^rj) (cd I -- — (abc) drfl) {d \ w). 

For the third compound {abc j now resolves into factors 
{abc){^ril). 

The. reader will have no difficulty in finding the dual form of 
the other fundamental types (I), (III) of §9, p. 45, by making 
an alogous transformation s . 


EXAMPLKS 

1. If ti > 3, prove 

(be I 7)^ ) (a I 5) - 1- (ca | (^ I 5 ) + (<^^ 1 (c 1 5) = ((d>c | 

and adapt the result to the binary and ternary cases, w = 2, 3. 

2. If w > 4, 

(he I '/jO (ad I E<o) 4- (ca | riQ (bd | 5o)) 4- (ab | yjO (cd | ^co) 

= (abc 1 ir^^) (d I o>) — (abc | (d | 5). 


( T) SS4 ) 


8 



CHAPTER VI 


Special Types of Determinant 

1. Properties of Matrices and Determinants connected with the 
Leading Diagonal. 

Associated with every square matrix A of order n is the 
matrix A/ — A obtained by subtracting A from each element of 
the leading diagonal, and changing all the signs. The equation 
in A obtained by equating to zero the corresponding determinant 
(A/ — A) is called the characteristic equation of the matrix A, 
For example, if n 3, and A — [a, 

A — — ^12 • 

/(A) I A/ — ^ I — (121 ^ ^^22 — ^^23 

(I 3 I (?32 A «33 

This equation is a cubic in A, and in general the characteristic 
equation is of order n, as is apparent by writing down the leading 
term in the expansion of this determinant. So 

/(A) — A’^ 4 " % A” ^ 4“ • • • + A” ' “h • • • 

where each a^ is a polynomial function of the elements 
The n roots A^, Ag, . . . , A,^ of this equation /(A) ~ 0 are called the 
latent roots of the matrix. 

As it is useful to know how to perform the expansion of this 
characteristic determinant, a method is suggested by the follow- 
ing examples. 

EXAMPJ.KS 

1. Prove 

-f X Cl di 

Qo \ -I- X c., 

a., 63 Cg -{- X (/g 

((4 ^4 d" X 

X* + («! -j- 62 + C3 4 - dn)x^ + I j 62C3 I + I «i (/4 I + I flflCg I I 62(^4 I 

4 “ j ( 4 “ I c^d^ I t 4 " I ( ^ 2 ^ 3^4 I 4 " I ^iC^d/i j -j- j (iihndii [ 

+ I ]x aih^c^d^l, 

[Differentiate both sides of the identity with regard to x four times. 
Put a: — 0 at each stage.] 

2. Generalize this result. 


9S 



99 


Chap.VL] the CAYLEY HAMILTON THEOREM 


3. If jOj, P 2 t * . • ,Pn are the leading diagonal elements of a determinant 
A, and P, P/, P/;, Pijk , . . . denote the values (after putting pj — jp, = . . . = 0) 
of A» the co-factor of pi, that of pipj, &c., show that A can be expressed in 
the form 

P + 'LPipi l- ^Pfjpipj-^- . . . f Pip. . . . pn- 

4. Prove 


Pi 

61 

^1 


«2 

Pi 



«3 

^3 

Pz 





' P2+ ' lh+PtPlP3- 


2. The Cayley Hamilton Theorem. 

For a second order matrix the characteristic equation is a 
quadratic. Thus if A 

I A- 


/(A)-= 


a 

— c A 


[c j]- 

— A- — (a -f d)X + ad ■ 


0 . 


If we construct the corresponding function f (A) of the matrix 
A itself by evaluating 

— (a + d)A -f {ad — be) I 


as a second order matrix, we obtain the remarkable result that 
all the elements of this matrix are zero. This can readily be 
verified. It is an instance of an important theorem which runs 
as follows. 

The Cayley Hamilton Theorem. — Every square matrix 
satisfies its own characteristic equation. 

Proof . — 

Let the matrix XI — A, constructed from a given matrix 
A of order n, have for its adjugate (§8, p. 70) B. Since the 
elements of XI — A are at most of the first degree in A, their 
co-factors in | A/ — * 1 are at most of degree n — 1 in A. Hence 
we write the typical element of the adjugate as 

^0 + ^1 ^ "fc • • • + 

where the n coefficients 6,; are polynomial functions of the 
elements of A. 

Thus the matrix B itself may be written as 
A + . . . + A'^“"S 

where 5^ is a matrix whose typical element is 64.. 



lOO 


SPECIAL TYPES OF DETERMINANT [Chap. 


But since | A/ — ^ | ^/(A) is the determinant of \1 — A, 
it follows by direct multiplication (§8, (11), p. 70) that 

(A7~^)5=l AZ-^|/-/(A)/. 

Hence 

or xi^B, A" - Ai^B, X^ ---/(A) 1 

+ + a,) L 

This is an identity for all values of A. Equating coefficients of 
A^ A'‘~\ . . . 5 A^, A^ we obtain 

Bn-i “ GqI 




-ABq ^-aj. 

By fore multiplication with A‘\ . . . , 1 respectively and 

addition, we obtain 

0 - ao A^^ + A‘^-^ + . . . + A + aj, 


which proves the theorem. 

It is to be noted that the theorem holds for singular matrices. 

There are alternative ways of writing this result. Since we 
can factorize the expression /(A) in terms of its latent roots A^- 
as 

/(A) = ao(A~Ai) {A-A2)..,(A-AJ, 
we can therefore also write 

f{A) = a,(A~ XJ) (A - XJ ) . . . XJ), 


where the order of factors is immaterial (cf. Ex. 10 , p. 72). Here 
is a case where, by the Cayley Hamilton theorem, the product of 
n matrices A — A, / is zero, though we cannot assume that any 
of the factors are zero. 



VI.] 


BORDERED DETERMINANTS 


101 


EXAMPLES 


1 . If A is the diagonal matrix 




prove thnt A also satisfies 


the characteristic equation /(X) ~ 0 of the matrix A, 

2. If B is an arbitrary non-singular matrix, prove that and 

B^^AB both satisfy /(X) 0. 

3. Show that this is true for a square matrix of any order. 

4 . Prove the latent roots of the reciprocal of the third order matrix A 
are Xl~^ Xj-^ X 3 - 1 . 

6 . If the latent roots of A are X, X, jx, prove that /(X) = 0 is satisfied 
by A : 


r X 1 . -1 

. X . . 

- . . JX- 


6 . Verify that A 
equal. 


rX 1 . 


1 satisfies /(X) = 0 if the three roots are 
X J 


7. Show that BAB~^ and B-^^AB also satisfy /(X) = 0, in 6 . 


3. Special Types 0! Determinant. 

Bordered determinants. Symmetric and skew symmetric deter- 
minants. 

If above and to the left of a square matrix [a^j] of order n 
we add a row and column 

0 Ui U 2 ... 

^’1 

^2 


we obtain a bordered matrix of order n + I. Its determinant is 
also said to be bordered and is written shortly as 


So if n = 3 an example is 


. Ui 


. Ui U2 

Vi 

V 2 a^ ^2 
^3 ^3 ^3 


W 3 

^2 

<^3 



102 


SPECIAL TYPES OF DETERMINANT [Chap. 


We may border more deeply by adding a double row and column 
meeting in a set of four zeros. We should write 

• . "^*2, ^^3 

. 

^2== • 

V2 «2 h H 

U3 63 C3 

Such a process can be generalized, giving what may be called 
bordered determinants of the first, second, . . . rth orders derived 
from a nucleus A. Hence if 0,., (f>,, each denote matrices of n rows 
and r columns, or briefly matrices of currency r, and if as usual 
the accent indicates transposition, then we can write the general 
bordered determinant derived from 

A=|aoi 

as 

S - ® I 

Zif. - - I . 

9r I 

Now consider values of r between 0 and n, of which the above 
case Sg is typical. Expand S,, by the ^ ^ Laplace development. 
For Sg, (n = 3), the result is three terms 

{uu')22 (w'«^)l23 + (^^^')31 (w'^>)l23 + {^^^')l2 • 

Expand each factor (vv'a) by the {r n — r) development and the 
result is linear in the set 

K)23, K)31» (w0i2* 

We infer that 'L,. is bilinear in the two sets of determinants of 
the border matrices 0',., 

If r = n the same argument shows that the bordered deter- 
minant is merely the product 

better written 

(-)"|w.v! hoi- 

If r == 0, Sq is A. If r > n, 2^. = 0. 

4. Reciprocation of Bordered Determinants. 

Bordered determinants obey the principle of duality in a 



VI] 


BORDERED DETERMINANTS 


103 


maimer which recalls the Jacobi ratio theorem. In fact the 
following results may be regarded as a corollary of this theorem. 
For consider three square matrices of order n 



“(Xj 61 ... 7 n{ 


( 

1 


Vx ■■■ 

A- 

(12 &2 • • • '^^2 


3^2 y-i ••• k 

^ 

, )— r 

^2 V2 • ’ • ^2 


_a„ . . . )»„. 


J^n Va • • • 


Vn * * * 


together with their reciprocals with raised suffixes. For brevity 
take /A -- 3 , so that, as before, | ^ | | 62^3 while 

I X I I (i/gZs j, &c., and | S | = | ], &c. Then the 

following identities will hold: 


• 

• 

Xx 

a-2 

3-3 

• 

• 

yx 

«/2 

ys 

. 

. 

2l 

Z 2 

^S 


Vi 


«1 

h 

Cx 

V2 


«2 

h 


^3 


^3 

(l^ 

h 

*^S 


• 

• 

yx 

yz 

ys 


• 

• 

h 

2=2 

k 




<h 

bx 

k 


V2 

C 2 

®2 

bs 

<•2 




as 

bs 



, 

h 

Z2 

23 


Cx 

(lx 

bx 

Cl 


C 2 

a.^ 

bs 

C 2 


u 

as 

bs 

C 3 


«1 

^1 

Cx 


62 

C2 

^3 

bs 

C3 



61 

cl 





62 

C 2 

) 



a® 

63 

C3 





a;i 

a ;2 

a^ 




ai 

61' 

cl 




ffl 2 

62 

c 2 

) 


|3 

a® 

63 

C3 





a;! 

a :2 

a-3 


• 

. 

y^ 

y^ 

y3 


e 

V 

fli 

61 

cl 

> 

e 

7)2 

a 2 

62 

C 2 


e 

7,3 

«3 

63 

C3 


, 

, 

, 

xl 


»3 

. 

. 

. 

/ 

t 


. 

. 

. 

zl 

22 

2 ® 

e 

7)1 


ai 

61 

cl 

e 



ffl 2 

62 

c 2 

e 

7,3 

^3 

a® 

63 

C® 


where /> = — [ \ \ | | fiiyaCa I- 



104 SPECIAL TYPES OF DETERMINANT [Chap 

The proof is immediate, by expanding the left-liand side 
determinants as bilinear functions of the border matrices, and 
then raising the suffixes by use of the Jacobi ratio theorem. 

For instance, in tlie third identity, a typical term involving 
H is So 1 ao ^3 I* theorem 

|«,C3|--!ai62^‘3KA 

whence 

2^2 1 «2 ^*3 H P 1 

agreeing with the corresponding term on the right. 

In general for n-rowed matrices A, X, ‘E we have 

p==(~-r\A\ ui |H|, 

and the letters absent in the borders of determinants on one side 
of the identities are present on the other, their arrangement being 
determined by the algebraic complement rule as in Jacobi’s 
theorem. 


5. Bordered Adjugate Determinant. 

As in the Jacobi theorem itself it is useful to recognize the 
earlier form of the theorems of the last paragraph. Namely, 
when all elements with upper suffixes are replaced by capital 
letters denoting co-factors of elements with low^er suffixes, the 
theorems hold if p is multiplied by suitable positive integral 
poweis of I |. 


6. Symmetrical Matrices and Determinants. 

These are symmetrical if transpovsition of rows into columns 
makes no difference, so that 


For example, 


a h 
h b 




!y)- 


a h g 
h b f 

9 f c 


a 

h 

9 

u 

h 

b 

f 

V 

9 

f 

c 

w 

u 

V 

w 

d 


The condition that a matrix A shonld be symmetrical can be 
written A = A'. 



VI.] BORDERED ADJUGATES 


los 


EXAMPLES 


in A =- 


a h g 
h b f 
9 f c 


y l^rove 


1. If capital letters denote co-factors of corresponding small letters 
. u V w\ 
u a h g 
V h b f 
wgfc 

(Au^ liv^ Cw^ -'f 2 F vw -j- 20 wu -f 2Huv)y 


ax^ r hij^ -\- cz^ 4- 2gzx 4- 2 hxy 


, 

, 

u 

V 

w 


. 

. 





u 


a 

h 

9 

“ ax^ 

V 


h 

b 

f 


w 


0 

f 

c 




11 

u 

V 

w II 

V> 

z ~ 

Ml 



-•Ir 


Expand as a quadratic in the same way 


. X y z 
X A U 0 
y H B F 
z 0 F C 


Answer: —A{ax^ + by'^ -f &c.). 

2, If A is an arbitrary square matrix, and A' is its transposed, then 
A A' is symmetrical, and so is A' A, 

[Use the fundamental relation of type (BCY = C'B']. 


7. Skew Symmetric Determinants. 

A “ I (If 1 1 is skew symmetric if ~ so that on the 
leading diagonal every element is zero, since 

In this case the matrix [a^j] is said to alternate in its double 
suffixes: interchange of suffixes is accompanied by cliange of 
sign. So interchange of the set of rows and the set of columns 
is accompanied by n changes of sign, one for each row, n being 
the order of the determinant. Thus if A is the matrix, and A' 
its transposed, 

A==-A', butl*41 = (-rM'|, 
whence A -- (— )"A. Accordingly we have the theorem: 

If n is oddj A is identically zero. 



io6 SPECIAL TYPES OF DETERMINANT [Chap. 

A skew symmetric matrix or determinant is completely 
specified by the \n{n - 1 ) elements in the triangle above its 
diagonal. Thus if n — 3, we might have 



//, however, n is even, A is a perfect square function of its 
elements. 

For let 

. a be... 

— a . d e . . . 

A,- -b -^d . / ... . 

-C -e —f . ... 


Consider the co-factors of the leading four elements ^ 

Since the diagonal co-factors are manifestly skew symmetric of 
order n — which is odd, they vanish. Also if A is co-factor of 
a, that of — a is the transposed of A with every sign changed. 
Hence it is (—y''‘^A, — A. The determinant of these four 

co-factors is therefore 

0 A 
-A 0 ’ 

But by the Jacobi ratio theorem this can be written 

. / ... 

— f . ... ~ ^n^n—2 

say, where A „_2 is also an even skew symmetric determinant. 
Thus if A „_2 is a perfect square, so is A„. But Aj obviously is 
so. Hence by induction so is A,„ or else it is zero. It is not 
zero in general, since in the special case when h — c— ... all 
vanish except a, f ..., A , where a,f, ... are the 



VI.] 


SKEW SYMMETRIC DETERMINANTS 


107 


letters occurring alternately in the positions nearest the leading 
diagonal. 

8. Characteristic Function of a Skew Matrix. 

Let a determinant be expanded by its principal diagonal as 
in §1. In particular if the determinant 

A c —h 
— c A a 
b — a A 

is so expanded, the result is 

(a^ A3. 

Suppose >S is a skew symmetric matrix and | A/ + *5 | is the 
corresponding determinant with A replacing each zero in the 
leading diagonal, as in the above example for the third order 
case. Expanding by its leading diagonal in an ascending series 
for A we have 

I A/ + .S| = P + 2P,A + SP,^.A2+... + A'^ 

where P, is the co-factor of the ith diagonal element in | S | , 
and P,i that of the product of the ith and jth. diagonal elements 
in I aS |, and so on. But all such co-factors are skew symmetric; 
hence those of odd order vanish, and those of even order are 
perfect squares. Thus reversing the terms of the series, we have 

I XI + S I - A'‘ + + PA"~^ + . . . , 

where Q, R, . , , are sums of squares and therefore are essentially 
positive if the elements a, 6, c, ... of S are real. 

Hence the matrix XI S cannot be, singular if its elements 
are real, as long as A > 0. In particular / ± gives two non- 
singular matrices. 

9. Summary of Theorems "on Compound Determinants. 

In spite of the great intrinsic interest of the subject, and the 
wonderful flexibility of determinants as practical working tools 
in many branches of pure and applied mathematics, there is 
still a considerable absence of systematic knowledge of even the 
main results in the theory. It may therefore be of help to the 



io8 SPECIAL TYPES OF DETERMINANT [Chap. 

reader to have a short statement of at any rate one main branch 
of what is indeed a very wide subject. 

AVe may sum up ^ the theory of compound determinants in 
eight related theorems. These appear in their relative positions 
most clearly if a numerical notation is adopted, where digits 
have the significance of letters in what has preceded, and in 
addition a group of less than n digits indicates a certain minor. 

, I. ((234) (134) (124) (123)) - (1234)3. 

Cauchy’s theorem on the adjugate, 1812: The adjitgate is the 
(n — l)th 'power of the original determinant (§7, p. 67). 

II. ((134) (124) (123))== (1234)2(1). 

Jacobi’s theorem on the adjugate, 1831: A minor of order r of 
the adjugate is equal to the complementary minor in the original 
determinant A 7nidtiplied by the (r — l)th power of A. 

III. ((12) (13) (14) (23) (24) (34)) = (1234)3. 


Sylvester’s theorem on the mth compound, 1851: The mth 


compound of a given deterrninant A is the 
(§9, p. 87). 


Cii) 


th power of A. 


IV. ( (14) (23) (24) (34) ) = (1234) ( (34) (24) ). 


Franke’s theorem on the mth compound, 1862: A minor of 
order r of the mth compound is equal to the complementary minor in 

the adjugate (n — m)th compound multiplied by the — 
power of the original determinant, ^ ' ' ' 

V. ( (a23) (163) (12c) ) = (123)2(a6c). 


Bazin’s theorem, 1854: If the determinants obtained by replacing 
a column of A m all possible ways by a column of B are elements 
of a ''hybrid^' compound determinant, the latter is equal to A”“^ B 
(p. 56, Ex. 8). 

VI. ((1634) (12c4) (123rf))= (r234)2(l6cd). 

Reiss’ theorem, 1867 : Any minor of the Bazin hybrid compound 
of A and B is equal to the complementary minor in the reciprocal 

1 1 owe the following illuminating summary to Dr. A. C. Aitken, 



VI. ] SUMMARY OF COMPOUND THEOREMS 109 

hybrid (i.e. that in which the roles of A, a'nd B are interchanged) 
multiplied by a power A' ~‘ of A. 

VII. {((d)U){a2cA){a2U){\bcA){^bM)(n^^ (UM)^ (abcd)^ 

Reiss’ theorem, 1867 (Picquet, 1878): The hybrid comj)onnd of 
A and B whose elements are the deierminamts obtained by replacing 
in all possible ways m columns (f A by m columns of B is equal 

/n-U /"-U 

to A^ 

Bazin’s theorem is the case m, — 1 . 

VIII. ((afe34)(a2c4)(a23rf)(16c4))= (1234)(«fecrf) ((a2c^^ 

Reiss’ theorem, 1867 (Picquet, 1878): Any minor of the Reiss 
hybrid compound is equal to the coynpleynentary minor in the 
reciprocal Reiss hybrid, multiplied by 

Theorem VI is the case when yyi ~ 1. 

The above are the eight chief results in their actual order of 
discovery. Theorems I, II, III, V alone have been proved in the 
preceding pages, but the others can be dealt with by the same 
methods. 

Theorem II in the notation of p. 52 would follow from 
(134 . 124 . 123 . joyr) = {l2M)^(pqri) 

by decomposing the first matrix 134. Jacobi’s theorem then 
follows by equating the various coefficients of (pgr)fjf^. on both 
sides of this identity. This in fact gives another proof for wffiat 
has been called the Jacobi ratio theorem in §3, p. 78. 

We may very properly write 

I: II:: III: IV 


to show the relation of the first four of these theorems. 



CHAPTER VII 

Differentiation of a Determinant 


1. The Polarizing Process. 

When the general n-rowed determinant 

A - I Cij I - I aihgCa . . . m„ | 

is regarded as a function of its n- different elements, treated 
as independent variables, it yields the result - = where 

o e^j 

is the co-factor of This is simply because A is a linear 
function in the single quantity e^j. 

Again, since 

A = a2i42+ • • • + • * 


and Ai~d^\dai, it follows that 

0A , 0A^ 

ai + • • • + 

0 d-^ u do 

which may be abbreviated to 

0A 

da 


0A 

0«i. 




( 1 ) 


( 2 ) 


(3) 


the latter introducing the notation which separates the differential 
operator from its operand. In such a case it must clearly be 
0 

noted that a and — are not commutative. 
da 


For instance, if n = 2, 

a\ . M 

^0% 


(“ D^= 


whereas 


I 0a/ 


Oi — + a2 


da^’ 


dtti' 


ffliAd- - — 02 A 


d tto 


_l_ A^«2 . - 


A:-> + a,;_. + A;:^ + 02 


dtti da^ 


da. 


da. 


3A. 


(4) 


no 



Chap. VIL] 


POLARIZATION 


III 


The identity (2) is a particular case 9 f Euler’s theorem for a 
homogeneous function of degree s in its variables. Thus if 
fi^v > ^'ti) ^ function 

(* ^)f - 3^1 -f- . . . + X„ ==sf. . . • (5) 

V dxJ dr^ 0x„ 

The determinant may equally well be differentiated following 
any row or column. Accordingly in the double suffix notation 
we have results analogous to (2), such as 


0A , 



'1^ 

acy^ 

I-- 

de.,j 


9A , 

0A , 




, 

• • + — ' 


Again, since 
therefore 


, 0A . . , 3A /, 1 9 


^ , 8A , , y 3A 9 \ . .Q. 

4 - [b ^ )A, . . (8) 

aai da, I \ ca^ 

and likewise for any other such pairs of columns, other than 
the a and h column used here. 

More generally, if [ xi, . . . , denote the n elements 
of an arbitrary column not necessarily contained in A, the 
operator 

(a; I ;f ) ;r. ^ -f . . . + . . . . (9) 


has the effect on, A of stdjstituiimj the column of x's for that of a’^?. 

Similar remarks apply to the rows. 

Hence the effect of altering a determinant by substituting 
a new column or a new row for an original column or row- is 
attained by a differential operator: and this operator, as in (9), 
is of the inner product type. Such a process, which is very 
common both in algebraic geometry and in the theory of 
invariants, is called a polarizing process. 

Various notation has been used for this process, acting upon 
a function /of variables Xg, . . . , such as 




1 12 DIFFERENTIATION OF A DETERMINANT [Chap. 


2. The Capelli Operators. 


The process may be repeated, with the use of several sets 
f . . . , I' , 1 • • • > r » * • • replace columns of the 
determinant. Thus if A “ {ahc . . . m), 




d 

^ A“ (x6c. . .m), j 


\ 

da 

/ 

0 \ 

/ 

0 \ 1 



[x 

^^)A-- (xyc.,.m),; 


. ( 11 ) 


and so on. 

Let us now suppose that all these sets x, y, . . , are perfectly 
arbitrary but independent of a and h, when we regard all the 
elements bj, ... as variables, so that the ordinary laws of 


scalar numbers may apply to expressions involving x„ yj, 
&c. It follows that in the above result 


d 

da; 


(*' 3 ®,) i. equivalent to (x (, 


0 

06 


( 12 ) 


for the X standing to the right of 


06 


is unaffected by the differen- 


tiation. This is a feature which is probably familiar to the reader 
through the study of linear differential equations with constant 
coefficients. 

Next let the set x be interchanged with the set y, to give a 
new identity 



. . (13) 


Here on the right we may write -- (xyc . . . m) by inter- 
changing two columns of the determinant. On subtracting 
(13) from (11) 


( 

d\ 

( 

d\ 

{x 

dJ’ 

(y 

da' 


d \ 

/ 

d \ 

[x 

db)’ 


db) 


A — 2(.Tyc . . . m) . . (14) 



VII.] 


CAPELLI OPERATORS 


”3 


and by the theorem of corresponding rnatrices (§4, p. 79) the 
left-hand side is naturally written 




3 3 \ . 

a„ u)^ 


= (15) 

^ cb ca^ 

The notation must therefore be used with caution, for in the 

operators the symbol - is not short for --- but for various 
da ob da 06 

determinantal expressions; and it alternates in a and 6 as it is 
a matrix inner product (§5, p. 82). 

Proceeding in this way until u auxiliary sets x, y, z, . , , , t 
are involved, we obtain the following identities, which for 
shortness are written out for the case when n ~ 4, 

A — {abed) I aj |> 'i 


X ^jA ---^{xbc(l), 


1 da 06. 
0 0 0 ^ 


^ A 2 ! {xycd)f 
) A — 3! (xyzd), 


8a 88 8a)^ 




These operators are sometimes known as the Capelli operators, 
while the last of the series introduces us to the very important 
special case of such, involving the Cayley operator constructed 
from independent variables: 


3 

3 

3 

da^ 

0 (^2 

3 a,. 

3 

3 

3 

36i 

362 ■ 

■ 

3 

3 

3 

dmy 

dm^ 

dm,, 


( D 884 ) 


9 



1 14 DIFFERENTIATION OF A DETERMINANT [Chap. 


3. The Cayley Operator. 

Theorem. — The effect of the Cayley operator upon the sth 
power of its determinant A is s (s + 1) • • • (s -\- n — 1) A®^^ 

It will be noted that if n — 1 this reverts to the well-known 
daf/da = So the theorem gives a very interesting 

generalization of an elementary fact. 

For clearness we consider the proofs when n - 3, and 


^1 

^2 ^2 ^2 ’ 
^3 ^3 ^3 



d 

d 

d 


dai 

da.2 

da^ 

II 

CJ 

d 

dbi 

d 

db^ 

d 

db^ 


d 

d 

d 


dci 

dC2 

dcs 

Since dAjda; ~- 

Af, we ha\ 

^e 

dA^'/dai 

= sA' 

'-’(6 


whence 




/ a , 3 , 

\^dai ^da.i 

d 

'3 o 

aa^ 

)A*:- 

or 


d N 

da^ 

)A*-: 


or hr I A^-:6‘A-^"-^(.rtc). . . . (17) 

\ 1 da/ 

Differentiating the right side with regard to bi gives 
s (5 — 1) A^” - Bi (xbc) d- 5 A'’- ^ (cx)j^, 

whence, after multiplying by an arbitrary and summing for 
1,2,3, 

(y ^^^'=s{s—l)A'‘--{ayc){xbc) + sA'‘~^(xyc). (18) 

But (aye) (xbc) — (axe) (ybc) — (abc) (xyc) — A(xyc) identically 
(p. 42, (29)). Hence after rewriting (18) with x, y interchanged 
throughout and subtracting from (18), we have 

d d 

= s(s-t- 1)A'“' (a:j/c) (19) 

»Cf. Grace and Voung, The Algebra of Invariants (Cambridge, 1903), p. 269. 



VII.] THE CAYLEY OPERATOR 

Next, in the same way, 

a 0' 


li: 




— s{s 1 ) (.9 — 1) A*" “ {abz) (xyc) + s{s -f- 1 ) A*' ^ (xyz), (20) 

Interchanging x, y, z according to the scheme xy, z which is 
xy, z yz, X zx, y 

we obtain three such equations as (20): and the result of adding 
them up is (p. 42, (28)) 


xyz 


+ 3s (s 4- 1) A'"* {xyz) 


since the last term is the same for each, {xyz) == {yzx) = {zxy). 
Thus 

{xyz)ilA'' — 5(5 + 1) (^ — 1 3) A*"”^ {xyz) 

or 

QA'‘=-s( 6*+ l)C5 + 2)A‘^“h . . . (21) 


EXAMPLES 

1. Prove by this method that if A = (a6 . . . e/. . . m) is a determinant 
of order n, then 

(a; I )a* = 5A^“^ (a’6c . . . m), = 5(5 4-l)A^'“ha^yc . . . m). 

2. For k < n, if a; ... 2 / and a . . . c both denote k columns, prove 

(^* * *^ I aa ” ’ A"" + 1) . A; — 1) {x . , ,y f . , . m). 

[Use induction, and proceed as before.] 

3. li k ~ Uf deduce the theorem 

HA^ = 5(5 + 1) . . . (5 H- n— 1) A»“h 

4. If r < 5 , prove 

or w - 1) ! (5 -f- n - ^) ! {s + r) ! . 

"(5-2)!^ {s-r)\ 

5. If r — 3 , this becomes 

lPA«-=n» (w + 2)! (n- f ^- 1)! 

' 1 ! 2 ! *“ («- 1)1 * 

6. If r > 5 , prove Li»‘A^ = 0. 



ii6 DIFFERENTIATION OF A DETERMINANT [Chap. 


7. Prove 


dai dbj 



— s{s + 1) ^ (s f ^ — 1) \fi . . . Wn I , 


where | aibj . . . ] and \ fi . • . m,i \ are complementary co-factors in A. 


4. Theorem o! Corresponding Matrices adapted to the Capelli 
Operator. 


If denote the operator (x | d/da), then the theorem of 
corresponding matrices yields 


{x 


{x 

k)- (s'lal) 


^6 


Va 

Vh 


( 2 


( 22 ) 


This breaks down if the variables a, b are replaced by x, y or 
by functions of x and y, because the left-hand factor of each 
term of the expanded operator, x,iyf,— x^,y,^, acts on the right. 
Thus we find, if the determinant is expanded by columns, 

0 

The second term on the right, — x^^f, is due to in x,j acting 

directly on y^ in Also, if we expand by rows, writing the 
determinant as Xj^ y^ — y^ x^j we obtain still another result. Let 
us therefore agree to expand by columns in each case when 
this ambiguity may arise. 

Further, let the two letters of an element x,^ be called the 
upper X and the lower y, so that the lower letter represents a 

0 

set of differential operators ^ . Then we notice that when a 

lower letter y in an earlier column is followed by the same y as 
upper letter in a later column, a new term, as already remarked, 
may arise. It is well to have names to distinguish these terms. 
In an operator such as the above, terms arising from direct 
differentiation upon / are called extrinsic, but terms arising within 
the operator are intrinsic. In (23) above, — Xj^ f is an intrinsic 
term, while the summation 2 gives a series of extrinsic terms. 


yx 

Vy 



CAPELLI’S THEOREM 


”7 


VII 1 


By rearranging the terms of (23) we have the relation 


Vx 


/= S {xy)ij 


3 

^dx 



d d 
dx dy 


)/• (24) 


This ingenious device absorbs the intrinsic term into the operator 
by adding a new extrinsic term x^f through increase of the lower 
right element yy by unity. It was Capelli ^ who first discovered 
this law of adjustment in its generality, which can take the 
elegant form for r < n sets of variables x, y, , , , ^ z, t, 


Vx ••• ^3 

Xy yy+\ ... 2 ;,^ 


Vz • • • 2,+ r— 2 

Xf y, ... tf + r—l 



. zt 


_3 0 ^ ^ ~ 

dx dy dz dt) 


(26) 


Here the leading diagonal has 0, 1, 2, . . . , r — 1 added to its 
respective elements, which otherwise agree with the algebraic 
theorem of corresponding matrices. 

Proof , — If we expand A by a Laplace (2 : r— 2) development, 
every minor from the first two columns is of the required type, 
since those involving row^^g the law shown in (24), and all 
others have no intrinsic terms. 

Accordingly we assume the theorem true for all minors of 
the first r — 1 columns, and proceed to prove it inductively for 
A itself by the (r — 1 : 1) development. On performing this 
expansion of A, we have 

A — Ti -f- ty + . . . + + r 1), (26) 

where T,. is the co-factor of the last element in row^. But by 
hypothesis 

Here the only intrinsic terms are due to the presence of t in 

1 Math. Anmlen, 29 (1887), 331-338 


xy ...z 


dy dz dt 


I)'- 



ii8 DIFFERENTIATION OF A DETERMINANT [Chap. 


But — . Hence, by summing i 1 , 2, . . . , n, the intrinsic 
dti axi 

terms of combine into the single expression 



a d d\ 
dy dz dxJ 

= — {xy...z 


d a 

dx dy 



on shifting - through r — - 2 places. Similarly each of the co-factors 

ox 

T 2 , . . . , furnish — T,. as intrinsic term. So the sum of all 
intrinsic terms in A cancels the 1) in the last term of 

(26) which itself is free from intrinsic terms. Hence we can 
write 

A = T/ + t ; 


where the accent denotes that the operation passes over 
, , , y tf and acts only on what may follow. Collecting terms 
we now have 



0 9 0 0\ 

dx dy dz ~dw 


which proves the theorem. 


Corollary I. — If r > n, A vanishes identically. 

Corollary 11. — A is unchanged by deranging x, y, . . . , z, t 
similarly in both rows and columns. For this leaves H unchanged. 

Corollary III. — A is unchanged by transposition, followed by 
reversal of the integers to r — 1, r — 2, . . . , 3, 2, 1, 0 in the 
leading diagonal. 

This follows by induction proceeding from the final column 
towards the left. 


EXAMPLES 


1. Prove 


Xx-f '2 Xy 

Vx Vu 1 

2,/ 




I ^{J 

2. Prove, if w > 3, and yj is independent of x and y, 

d d d 


Xx-j-2 x,f fz 
^X r.y ,,z 


^ (xr,z 


I dx dy dZi 


)• 



VII.] 


SUBSTITUTIONAL ANALYSIS 


119 


3. The general Capelli operator I ^ y ^ ^ involving 

\ \ dx dy dz dt du/ 

some upper and lower letters y, t alike, and others entirely independent, is 
equal to a determinant of type (25) with 0, I, 0, 3, 0 replacing 0, 1, 2, , 
in the leading diagonal terms. 


4 . If p, q satisfy all the laws of §1, p. 57, except that of commutative 
multiplication which is replaced by 

pq - - qp 1, 

prove that p^q^ jyq(pq | 1), p^q^ -- pq(pq -f- i) (py r -)> 


y 2 p 2 ^ ^qp _ ^3^3 qyf^q^ __ ] ) 2). 

[Try first when p — . Next try directly by substituting pq — 1 for 


qp in p(qp)ql 


dq 


6. Prove p^'q*' -- pqijjq 1) . . . (pq -f ^ — 1 ) 

q'-p' .= qpiqp -l)...{qp-r + 1). 


6. Prove p^q — qp^ -- 2p, p®y — qp^ ~ 3p^, p^'q — qp^'^ -- 


5. Connexion between Substitutional Analysis and Differentiation. 

The preceding investigations show that a close analogy exists 
between the typical process of algebra, the permutation, and 
that of analysis, differentiation. Indeed many of the properties 
of matrices, determinants, and the like, are rendered the clearer 
by bringing into play this twofold aspect of what is really one 
fundamental operation. A very simple example will suffice to 

d 

lead up to the general idea.^ Consider the operation of - upon 

dx 

cr" when n is a positive integer. If we write as a product of 
n factors each equal to x, 

xxxx . . . , 

it is clear that we can pick out an x in n different ways : we can 
then substitute unity for this factor in n different ways. If we 
do so, and add up the results we arrive at namely the 

result of operating with f on x*\ Thus 

dx 

~ x'^ ™ ] XXX . . . + x\xx ... + .••+ . . . 1 ~ (27) 

dx 

Similarly 

y x^^^-yxxx . . . + xyxx + xxx . . . y = (28) 

dx 

^ Cf. Macmahon, Combimitory Analysis, I (Cambridge, 1915), p. 224. 



120 DIFFERENTIATION OF A DETERMINANT [Chap. 


Here the left-hand operator is the simplest type of polar operator; 
and we see from the series to which it gives rise that it is essentially 
an operation involving permutations of substitutions. 

Now the determinantal permutation 

c ^ at, c he, a ca, b 

which takes its rise in the Laplace development of a determinant, 

A = I I = I I <‘3 I 1 ^3 + I ^>1^2 i «3 + | 1 h> 


has the same general features, only complicated by the change 
of sign which accompanies an interchange of letters. And if the 
determinantal permutation operates on letters representing 
columns of determinants, it is found in all cases to be expressible 

by differential operators. For example, the process ab, c applied 
to a product of determinants of any, the same, order, say the 
fourth order, 

(abde) (cfgh) 

may be equated to 

^ dx iy Iz) 


For this operator is equal to 



Also by (14), 


^ (xyde) 2{abde), (c (zfyh) = {cfyh). 

Hence the effect of the whole operation (abc ^ ^ is 

\ dx dy dzJ 

2{aMe) {cfgh) -j- 2{bcde) {afgh) + 2{cade) (bfgh), 


which gives the required result. 

In general, the permutation operations of §11, p. 47 which 
lead to the fundamental identities can be expressed as differential 

operators. For example, the series of terms, given by 



SUBSTITUTIONAL ANALYSIS 


I2I 


can be generated from a single product of 
two n -rowed determinants 

(aji aja . . . (ij^ . . . yj 

by the operator 

^ ( A B 33 3\ 

^ ^ 3j*j 3ic,; 37 /j 

where each x denotes a column of the determinant, A^ denotes 
i columns < 12 , . . . , a*, and Bj, j columns, while 


K being a row of i-[- j different suffixes chosen in .j different 
ways from the integers 1, 2, . . . , w. • ‘ 

g 

Just as Af is short for the ix n matrix Uj Ug . . . a, let 

0 3 3 0 0*" ^ 

be short for • • • - > and __ for — . . . Then we have 

dxj dxi dYj dyj dtjj 

the relation 


{A,L){B^M) - , [A,Bj 


d a 

\dXi i'Yj 


{X,L){Y^M). (31) 


The proof follows the lines of the previous special case. For 
as in (29), (30) 

TXi ay)^'('^' ax)(^-’’ ar)’ 


and by (16) 




Similarly for JS, Y. This at once yields the result. 

Incidentally this affords an alternative proof of Sylvester’s 
theorem (§9, (II), p. 45) when because the matrix 

product operator then factorizes as 

. . r. V /' 3 3 \ 


{AiB^)i^ 


dXi dYjJ’ 


showing that (A^ Bj) is a factor of the series {Ai L) {Bj M). 



122 DIFFERENTIATION OF A DETERMINANT [Chap. 


Once more, by the same reasoning, for several matrices 
Aj, Bj, if i j -f- Z: -f- • • • < n, we may write 


(AL) (BM) (CN) 

= (, 

ii 7!... V 


ABC 


d d d 

dx oY dz 


{XL)(YM) (ZN)... 


where the currencies of 

A X L 
B Y M 
C Z N 


are 

i i n — i 

7 ; n—j 

*k k n — k 


respectively. 


An important particular case of the above is the following 
type of identity, involving the Cayley operator and a product 
of factors by, . . . where ^2 + • • • "t" 

If n 2, — (db ) ; j 


if 


71 = 3, L2 by ~~ {abc) ; 


(32) 


if Qa^.byC.^d^~^ {abed), &c. ) 

Thus, if 71 — 3, ii ™ 2 J- . — , whence ii a., b,. c, 

dx^ dy^ dz^ ' 

== 2 ± «i& 2^3 — {abc). And if there were more than three such 
factors, the result would contain several terms, with a deter- 
minant like {abc) appearing in each. For instance, still with 
n ~ 3, 


Qa^hyCzd^ 


S± 


d d d 

dx^ dy^ dzs 




= {abc)d^-\- {dhc)a^,. . . . (33) 


Evidently the process mimics the ordinary rule of differentiating 
a product (cf. (27) and (28) ). 



VII.] EXAMPLES OF THE CAYLEY OPERATOR 123 


EXAMPLES 


- Tf V 1 ^ ^ ^ 

1. If LI L zb „ o o 

dxi di/i dz;i 

L'laxhjfCzfi'xb'y — (abc)a'xb'y b (a'bc)axb'j, b (ab'c)a'xby *]- (a'b'c)axby. 

2 . Llox^by^Cz " 4{ahc)axby. 

3. ctx**^ by"*^ CyV mnp (abc) ~~ ^ b,^ *“ ^ ~ * . 


4. Lyax'^^by^^CzP ■ 


ml nl p\ 


(abcYax'^^~‘'b,^~> CrP-'>\ 


{m — r)\ (n — r)l (‘P — r)\ 

5. Give the corresponding identity for a determinant 12 of the nt\i order. 

6. If the elements of A = (abc . . . m) are functions of a single vari- 
able /, and an accent denotes differentiation with regard to t of elements 
in the column indicated, prove 

=T= (a'bc . . . ra) + (ab'c . . . m) -b (abc ' . . . m) j- • • • + (obc . . . m'). 
at 

7. If ^ — rto + -r r • • • and A -- j I 

where the coefficients Uj are constants and the series is convergent, prove 
SIS =n\ |a, -1- in + l)a.jA + ^ 

8. Prove (i) = + 

(ii) SI log(l - A) = ~ (»- 1)! [(1 - A)^*-l]. 

9. ( ^ ==0. 

\dx^dy., dx.^dyj x^y^ — x.,y^ 

1 ®- 

= — #( — 5+ l)(a;iy2— 

11. Prove 12 I -- 0 for all orders of the determinant. 

A 

12. Prove the Cayley formula (§3, p. 114) true for negative values of s, 

13. The partial differential equation 

dH dH 


is satisfied bv 


dx^dy^z, dx^dyy^ 

c 1 1 -b b -b^ -b . . . I 
I 2! ^ 2!3! 3!4! ^ J 


where A = ^zVv 

14. The partial differential equation in nine independent variables 

• • • > ^3 

Ei . =F, or briefly nK-=F, 

dx^oy^dy^ 



124 DIFFERENTIATION OF A DETERMINANT [Chap. 
ia satisfied by 


: C 1 4- — - — -L 

l 3! ' 3!4! ‘ 3!4!r)! ' 


I 


For independent variables, 
A2 

nl ‘ ! (w + J ) ! 

2!A3 


= + :^; + 


3!A^ 


15. 


, 4- 

n!(w + r)!(/i 4- 2^! ’ n!(n f 1) f 2) !(ri -h 3) ! J 

Prove (a 1 1) log A = . (ab I A] k.g A --= 

\ I dx/ (xyz , . ,f) \ idx dyJ (xyz . , A) 


(ahc I — ^ ^ log A — 2! and finally 

V \dx dy dz/ ^ (xyz.,.t) ^ 

12 log A — ~ -i , Q2 log A ~ 0. 

A 

16. The solution of n linear equations for t), . . . , to 

oci^ + 2/«v) + //6) =- tti, i = 1, 2, . . . , 71 

ia given by 


(a|^^JlogA. vi-(«|^^)logA.&c. 


6. Jacobians. 

The determinant \dui/dxj\ of n rows (i) and columns (j), 
whose elements are the first partial differential coefficients of 
n functions u^, . . . , with regard to their n independent 

arguments ...» is called the Jacobian of the set u 

with regard to the set x. It can be denoted in various ways: 

0 M, ^ a (Ml, Mg, . . . , M„) ^ ^ 9 (m) ^ ^ 

dXj 0(a;i, ^ 2 , a?„) 0(a;) 


Its properties are essentially algebraic, once the fundamental 
facts of partial differentiation are assumed, and in particular the 
theorems: if (f> is an explicit function of r arguments u^, . . . , 

then 


^<l> 0 U 2 

dXi dx;_ 0^2 


^ d(f> du,.^ 
‘ 0?^,. dx' 


and if ip{Ui, u^, . . . , u,, x^, x^, . . . , xj = 0, then 


0^ CMi 0^ 0M, 

dui dXi du^ dxi dx^ 



VIL] JACOBIANS 125 

The chief properties of the Jacobian are contained in the 
following six theorems. 

I. If the u's are explicit functions of y^, , y„ ivhicli in 

turn are explicit functions of the iCs, then 

0(m) ^0{m) d{y) 
d(x) d(y) d{x) 

For by multiplication the (?*, j)th element in the product 
determinant is 

9 m .- 9^1 duj dy„ III ^ 

9 2/1 9 r; ■■■ 0y„ dxj dxj 

II. If the n equations Uj — Uj (xj, . . . , x„) can he solved for 
the x's in terms of the u’s, and the Jacobian | 0 Xj/ 0 Uj ] can be con- 
structed, then 

0 (t() d{x) 

. 0 X • 

For the {i, j)th element in the product is — S -y, leading to 
the unit determinant (§2, p. 32 ). 


111 . If Fj (Uj, U2J • • • > Ujj, Xj, X2, • • • > x^i) dj i 1 , 2 ^. 
then 

d(x) d{x)l d(u) 




For by actual multiplication 
d(u) 0 (F) 
d(x) d(u) 


dFidui _^dFi du„ 

0«i dxj ' du„ dXj 

JFi 

dXj 


{-) 


d(xy 


IV. Jacobi’s Lemma J — If A^, Ag, . . . , A,^ are the co factors of 
0Ui 0Uj 0Ui . 


0Xi 0X2 

identically. 


dXr 


in the Jacobian, then 
dXi 0*2 " 0r„ 


» Crellc, 87. (1844), 201-209. Collcclcd W'orlvs, 4, 317. 



126 DIFFERENTIATION OF A DETERMINANT [Chap. 


For dAJdxi consists of a sum of n — 1 detenninants, due to 
differentiating in turn the n — 1 columns of the co-factor Ai. 
We can then arrange all the 7i(n — 1) terms arising from the 
n differential coefficients dAJdXi as a skew symmetric matrix 
of order n, with terms arising from Ai arranged in the ith row. 
Since the matrix is skew, the sum of its terms vanishes. It is 
left as an exercise for the reader to develop this proof. 


7. Rank of Jacobian Matrix. 

The square matrix [dui/dxj] of order n is the Jacobian matrix, 
and its rank r is the highest order of a minor determinant A,, 
which does not vanish identically. What follows now is closely 
analogous to the theorem on p. 73. 


V. If afunctional relation ^(%, Ug, . . . , Up) = 0 connects p of 
the u’s, then every minor 0(ui, Ug, . . . , Up)/0(Xi, . . . , Xj) o/ order 
p involving these u’s vanishes identically. 


T? • c\ I I 

For since 0 == ^ ^ == ~~ + • • • + ^ we have p 


dXi dui dxi 


0Up 0x,: 


linear equations from which - ^ , . . . , may be eliminated, 

du^ du^ 

so giving the desired result. Incidentally the rank r is neces- 
sarily less than n. 


VI. If Ay = 4= 0, where r is the rank of the 

0(Xi, X2, . . . , X,) 

Jacobian matrix, then the functions u^, Ug, . . . , u,. are independent, 
while each of the n — x remaining u’s is expressible in terms of 
Ui, Ug, . . . , u,. 

By V we already know that ic^, . . . , u,. are independent, 
otherwise A,, vanishes. So we take these r together with x,._^.i, 
^r +29 * • • > ^®w independent variables and express the 

remaining x's as 

Xj=<f>j{Ui, , a:,.), 1, 2, . . . , r. 

Also let Ui=fi{xi, *2 x„), i=l, 2, n. 


Then by differentiation of u^, Wg, . . . , u^ (as functions of the new 
independent variables) with regard to {s > r), 




0 ^ 0 /, 

dxi dx„ dxr Sx^ dxg’ 

? =Ui j. . . . 4. ^Ik ^_h + ^/* 
dx^ dx^ dx^ d X,. d x^ d x^ ’ 

/: = /• -|- 1 j f -|- 2j . . . , 

EIiminatii)g d^Jdx^, . . . , d<j>^.jdx^, we obtain 

3/1 9/1 9/i 

^ > • * • > ^ J /V*"" 

C x^ 0 X^. 0 x^ 


9/, . . . ^ 

00^1 ’ ’ dx/ dx^ 

^Ik ^ 9/j 

0CCi’ ’ 0iC^’ 0rCj, 0^. 

Expanding by the last row, 

== 9(/l, • • • , /r) _ 

0(a;i, . . . , a:,., aj,) 0a;, 0(a;i, . . . , a:,) ' 

But by hypothesis the left-hand member of this identity vanishes, 
whereas 0(/i, . . . , fr)/^{^v • • • » ^/) Hence du^jdx^ 

vanishes identically, so that % is independent of x^. But s can 
be any of (r + 1 ),..., n. Hence % is expressible entirely as a 
function of the remaiifing variables , u/, and this 

proves the theorem. 

Corollary. — Provided its rank is r the matrix [0Ui/0Xj] need 
not necessarily be square. 

In fact the above proof holds when the number of is 
where m'^r. 



CHAPTER VIII 


Binary Forms 

1. Binary Invariants. 

We shall now consider, as a preliminary to more complicated 
structures, a particular type of polynomial called the binary 
form: and this will be dealt with broadly in the order suggested 
by the history of algebra since the time when Lagrange and 
Gauss hinted at properties of linear transformations, finally to 
be disclosed in an epoch-making publication by Boole in Novem- 
ber, 1841. 

Let us consider this partly from a geometrical point of view. 
Suppose that 

F{x, y) EH ax^ + 2hxy + by^ + ^x + 2/y • | c, . (1 ) 

equated to zero, represents a conic referred to Cartesian co- 
ordinates {x, y)y and that a change of axes is made, as indicated 
by the equations, 

y. x=--\x' + m^y''\ (2) 

y=l^x' -\-m^y'\ 

for the old co-ordinates x, y in terms of the new, x\ y\ The 
origin remains fixed, and the only condition imposed on 
? 2 , ^2 is the inequality 

\M\^. — ^ 2 % 4 = ^ (^) 

This is a homogeneous linear transformation from x, y to x\ y\ 
Frobenius and other writers, with no geometrical purpose 
immediately in view, call it a substitution rather than a trans- 
formation. 

First we remark that any function of x, y may be expressed 
as a function, of x' , y\ Let us write 

F(x,y)^F'{x\y') 

128 


. . ( 4 ) 



BINARY INVARIANTS 


129 


Chap. VIIL] 

to denote the two aspects of this function. Next if we collect 
terms of F in groups homogeneous in x, y, as indicated by 
the suffix r, we have in this case 

... (5) 

and in particular 

Thus U 2 — + 2hxy -|- by^ 

a{l^x'+m^y'f+2h(l^ x'+m^^y') 

== a'.T'2 + 2h;x'i/ + b'y'^ - U2\ 

provided 

a' = 2hl^ -f- bl/, 

7i' =r= al^ mjL + + I 2 ^>^ 1 ) + 6?2^2> • * (6) 

6' ” (~ 2hmi7n2 -t- 

It is at this point that the far-reaching result disclosed bv Boole 
may be seen. Boole remarked that the discriminant a'b' — h!^ 
of the quadratic reproduced that of together with a 
factor depending only on the coefficients of the transformation 
T, Namely, 

a* V — h!'^ ~ W2 — i2^^i)^ “■ • • (^) 

as is quite easy to verify. Let this be written 

I(a').^\M\Kl{a) (8) 

where the contracted functional notation is adopted for brevity. 

The factor | if | is called the modulus or determinant of the 
transformation T. 

Since this function ab — h^ as a whole emerges unchanged 
in structure, but for a factor | if independent of the three argu- 
ments a, A, b of the function, it is called an invariant of the trans- 
formation. More precisely this is called a relative, to distinguish 
it from an absolute, invariant, because cases occur in which /(a) 
and I(a') are absolutely equal without the help of an extraneous 
factor I if I 

The significance of result (7) is better seen if the previous 
conditions (6) are studied. Each new coefficient a', A', 6' is a 
complicated linear function of the old coefficients a, h, 6; and 

(D 884) 10 



BINARY FORMS 


130 


[Chap. 


only this particular expression a' V — or a product involv- 
ing this, turns out to have ab as 
a factor — a property which shall be 
proved later. 

Boole also found other interesting 
results which may shortly be stated. 
If oj denote the angle between the axes 
Ox, Oy, and a>' that between Ox', Oy\ 
then the transformation T may be 
regarded as a means of referring the 
same geometrical figure to two sets of axes Oxy, Ox! y' at the 
same origin. The assumption already made that M differs from 



zero implies 


i.e. 


sin CO 4 = 0, sin co' 4= 0, 

CO 4 = 0 modTT, co' 4 = 0 modn. 


The axes Ox, Oy are inclined to one another; and so are Ox\ 
0y\ Boole found the following relations: 

a'6'- A'2 


\M\- 

a'+6' 


smco 
sin CO ’ 

' 2A'cosco' 


a-{-b- 


sin^co ' 
2h cos CO 


sin^co 


sin^ CO 


( 9 ) 

( 10 ) 


Here again is an instance of invariants: this time the invariant 
functions involve a, 6, h, co, which is a more complicated set of 
arguments than a, 6, h alone. On the other hand the invariants 
are absolute, not merely relative. • 


2. Orthogonal Transformation and Invariants. 

If we impose the conditions 

^1^2+ 0 J 

upon the coefficients of the binary linear transformation T we 
call it now an orthogonal transformation. Geometrically these 
conditions show that if the lines Ox, Oy are at right angles, so 
also are Ox', Oy': and conversely. 

It will be seen that the values of the angles co, co' are now 

a.= ±co'=±^ (12) 



ORTHOGONAL INVARIANTS 


VIIL] 




for if we write — cos 0, = sin 6, == cos mg = sin ^ to 

satisfy the first condition, then 

0 == ~ 

Hence d and differ by an odd number of right angles. This is 
covered by two alternatives: 

Either ~ == cos d, — Zg “ % = 

or l^— ~ mg — cos d, Zg == sin 0/ * 

In the first case the axes Oxy are obtained from Ox'y' by a 

clockwise rotation through an angle 6: in the second they 



require, besides, turning over, to bring Ox into coincidence with 
Ox' and simultaneously Oy with Oy\ 

This set of congruent right-angled triangles illustrates the 
point. The origins are separated for clearness. All such coplanar 
triangles fall into two classes according as whether, by a rigid 
displacement in their own plane, they can be superimposed or 
not. In the figure, I and II are in one class, III is in the other. 

Algebraically the classes are included in one statement, easily 
verified, that 


M 2 = 


In TYln 


1 ; 


(14) 


and they are distinguished by extracting the square root. Thus 
if I M I — 1, Oxy, Ox'y' belong to the same class; 

if I M I = — 1, Oxy, Ox'y' belong to different classes. 

We observe that for both classes of orthogonal transformation 
fhere are two absolute invariants derived from the results (9) 
and (10), namely, 

„' + b'-a+b 
a'h'-h'^=ab-hK 


The importance of the above simple and well-known division 



132 


BINARY FORMS 


[Chap. 

into classes lies in the fact that it typifies the general case of 
orthogonal transformations for a set of n variables, not merely 
two. If we call the classes right and left handed, for distinction, 
a frame of right-handed axes can be brought to coincide with 
another right-handed frame by a rigid displacement in its own 
space; but an extra dimension of space is needed for a rigid 
displacement to bring it into coincidence with a left-handed 
frame. We shall return to this in Chapter IX. 

It might be supposed that the absolute invariants are more 
important than the relative, but such is not the case, as the sequel 
will show. Speaking generally, the absolute invariant corresponds 
to a part of geometry which is merely a special case of something 
more general. The relative invariant is the important one. 

All these expressions have a geometrical significance. For 
instance, Ug = ^ represents two straight lines through the origin. 
If a6 — = 0 the lines coincide, and if this is so, no mere change 

of axes will separate them; consequently a'6' — h!'^ ~ 0. But 
we can go a little further, supposing two real frames of axes, 
real co-ordinates, and real coefficients, so that | ikf |‘^ > 0. Thus 
ab — a'V — A'2 are both positive, or both zero, or both 
negative. This is illustrated by the conic given by F = 0, 
the cases of ellipse, parabola, and hyperbola answering to this 
threefold classification. 

Also ii a ~\-h vanishes and the axes are at right angles then 
1/2 = 0 represents a pair of lines at right angles. A change to 
other rectangular axes at the same origin leaves the property 
unchanged. This explains why a + 6 is an orthogonal invariant. 

3. Development of the Invariant Theory. 

The discovery made by Boole in 1841 was soon reinforced 
by an almost accidental observation by Eisenstein of an invariant 
belonging to a binary quartic. At once this attracted the 
attention of Cayley, Sylvester, and Salmon. Four years later 
Cayley put the subject in a more important light by asking two 
significant questions: (i) whether these ideas could be extended 
to binary forms I/g, 1/4, . . . , of all orders, and (ii) how far 
it was possible to discover all such invariantive functions? 

To these ends he invented a device which he called hyper- 
determinants, not unlike the device of denoting chemical sub- 



VIII.] 


BINARY QUANTICS 


133 


stances and reactions by symbols and equations. He exhibited 
the properties and behaviour of hyperdeterminants, making a 
practical working tool of them. Out of this calculus modern 
algebra may be said to have sprung.^ 

In answer to Cayley’s first question we have the systematic 
development of binary forms. The functions t/g, U^, 1/4, 
already introduced are called the binary quadratic, cubic, quartic 
(or biquadratic), . . . , n-ic. We shall find it useful to call the 
rational integral function of order n in its arguments, a polynomial, 
A homogeneous polynomial is a form or quantic. 

The order n is the highest degree in which the arguments 
occur in a term of the polynomial. 

4. The Binary Form or Quantic. 

We write the binary n-ic, 17, ^ as 

C7„ = ao*" + + Q) + • • . + a„y\ (16) 

which Cayley shortened to 

(do, ai, . . . , o„5a:, y)”. . . . (17) 

The binomial coefficients do not make this any less general than 
the corresponding form 

Po^“+ ' • (1®) 

which is sometimes used. They possess several clear advantages, 
especially when what are called polar forms are used, as we shall 
see later on. 

We assume the theorem that the equation Z7,j — 0 lias a root, 
and consequently, by repeating the argument, that itself 
has n linear factors. Namely 

U,, = po[x — ay){x— Py) ... (x—Xy). . (19) 


The set of n quantities 

are the roots of the n-ic U = 0. 

^ See an enthusiastic remark to the British Association (1869) by Sylvesler, 
recorded in Collected Works, 2, p. 656. 



134 BINARY FORMS [Chap. 

On multiplying out and comparing coefficients in (18) and 
(19) vve obtain the relations 

Sa = a + ^ + . . . + A = — PilPo, 
Ea^=a^-f-ay+ ... = ^?2/Po> 

Hapy ==apy + ap8 pJPq, . (2t)) 

a^y...A=--(— 


Here SajS . . . #c with r factors has 



terms, found by all 


the combinations of r different leUers chosen from a, j8, . . . , A. 
These are the elementary symmetric functions of the roots. 
They are called symmetric because any derangement of tlie 
letters a, , A makes no difference to the functions. 

Manifestly any product p - ) ) . . . ) and therefore any 

^pJ W 

polynomial in the n ratios p^ : p^, is a symmetrical polynomial 
in the n roots a, , A, as direct substitution would show. 

It can be proved conversely that any symmetric polynomial in 
the roots is a polynomial in the n ratios p ,. : p^. The emphasis 
here is on the word polynomial, so that the functions considered 
are rational and integral, although if n > 1 each root a, jS, . . . 
separately is a very complicated irrational function of the 
coefficients p,,. 


5. Gradient. Degree and Weight. 

A polynomial function <f){a) which is symmetric and homo- 
geneous in the roots is a sum of such terms as 



where ^ is a numerical coefficient. The factor p, Ipq is of degree 
r in the roots a. Hence for homogeneity in the roots, the 
expression ^ = i + 2j+ . . . + nl .... (22) 

is constant for each such term. This number w is called the 
weight of the term. It is given by counting the total of suffixes 
of the p's in the term. Also it gives the degree of ^(a) in the 
set a, j8, . . . , A, as relations (20) at once indicate. 



VIII.] 


DEGREE AND WEIGHT 


>35 


After multiplying throughout by po'", we can write of such 
a form, 

Po”^(a. A) ==^(Po. Pi. • • (23) 

where both ^ and 0 are homogeneous polynomials in their argu- 
ments. For no longer occurs in the denominator of any term. 

Such a function if/ is isobaric, i.e. of the same weight w for- 
each term. It is also homogeneous in the set p, since multiplying 
each of p^, p^, . . . , p,^ by the same quantity t leaves = 0, 
and therefore the roots a, . . . , unaltered. It is sometimes 
known as a gradient, 

G. The Induced Linear Transformation of the Binary n-ic. 

A binary form = (%, ^ 2 , . . . , ^ x, yY^ contains two 

sets, the set of coefficients 

a = a,,) (24) 

and the set of variables x, y. It may seem a trivial remark, but 
it is one with far-reaching consequences, that a form is linear 
in its set of coefficients. 

The transformation T (2) from x, y to x\ y' is conveniently 
symbolized by an arrow. We write 

T : x ~^ x\ (25) 

where x and x' do duty for x, y and x\ y' respectively. For 
this reason it is preferable as a rule to use x^, x^ rather than 
X, y for two homogeneous variables. 

By solving equations (2) we obtain the inverse transformation, 
written 

T-^:x^^x (26) 

Provided the determinant | M | of the transformation is not zero, 
this can always be done, even in the case of more than two 
variables. 

The first important result of this theory is that a form of 
order n remains a form of order n after linear transformation of its 
variables x, y. 

Thus, substituting for x, y in terms of x', y' we write 

U,=^V{x,y)^U\x\y% .... (27) 

^0^’^ + • • • + = Uq'x + . . . + afi/^\ . (28) 


so that 



T36 binary forms [Chap. 

This introduces a new coefficient set 

ao', <, . . . , a,/, (29) 


such that each a/ is a linear homogeneous function of the original 
set of coefficients a as in the particular case already worked 
out for the quadratic (6). The actual values of , a^', . , a,/ 
are best determined by Taylor’s theorem. Thus 

U(x,y)=U{ x' + m^y\ l^x' + m^y'). . (30) 

But U is homogeneous and of order n in x, y. So if y' = tx\ 
we have 

TJ (x, y) = x"' U{lj^ + ynj.), . . (31) 


Expanding this as a function of two variables ? 2 > hy Taylor’s 
theorem, we get 



in which are treated as constants, and U denotes lIQi, l^. 

Again 

V'(.x', y') = ao'aj"* + . . . + + • • • + a«'y' 


Comparing coefficients of f in these two equivalent expansions 
we have the following result 


— U {li, I2) — (%, du . . • j ^ 






( 32 ) 


where (m denotes + itu^ 

\ 01' Ov-^ 


dL 



POLARIZATION 


137 


VIII.] 

As an illustration of this general set the reader is advised 
to write down the set of coefficients %% ag', of a cubic 
after transformation T, It is important to appreciate that the 
relations giving a^y . . . in terms of the old coefficients 
ttQy . . . are linear in these. We typify this by writing 

: a' -> a, 

and conversely ^ , 

T^ :a 

to denote the inverse transformation. This is a special case of 
the linear transformation of a set of n~\~ \ quantities. Since 
the coefficients of the a’s are functions of l^y I 2 , and are 
thus completely determined by the coefficients of the original 
transformation 

T : x — > a;' 


we call or an indiiced linear transformation. 

Evidently there will be a close connexion between the deter- 

of T and the determinant A of ; in fact it is 


minant 




A raised to the power \n{n + 1). 


EXAMPLES 
For the quadratic, prove 



Ml* 

21^1, 

1 2 

H 



A 

= iiWlj 


l^nix ^2^2 

= (<i 

mj— 1 


Ui* 

2 mi 









1 3 

h. 



-\- 2lyl 

3W2. 



-f' 

2 /,rrt,W2, 



TOj’, 









= (lxm2 — 


Use colj -* coL + i-g ^c)l3 - 
I’l H 

3. Generalize the result. 


V C0I4, C0I2 — 2 1* C0I3 col,, &c. 


7. Polar Forms. 

In this last set (32) we have an example of the very important 
juocess known as polarization. It will be seen that the first 
coefficient af is the binary n-ic 

U{li, I2) ^ (^o> • • • j 5 y > 



x38 


BINARY FORMS 


[Chap. 


with I 2 substituted for x, y. The other coefficieuts a^, a,/ ,, , 
are the first, second, . . . , nth polars respectively of witli 
respect to mg. These equations serve to define such polars. 
In particular the last a,/ is the n-ic in m^, mg, 

(uo, Ui, ...,a,Jmi, mgr. 

All intermediate coefficients a/ (0 < r < n) are examph^s of 
double bimry forms: they possess double orders. Thus a/ is of 
orders (n — r, r) in the sets ij, Zg and nq, mg respectively. 

exampi.es 

1 . Write down the first and second polars of the quadratic 2 , 

4 with regard to the set yg* 

Ans. «o^jyi -f- a^(x^y,^ 4- x^y^) -f and a^y^^ + 

2. Form the first and second polars of the cubic + 

SttgOJiiCg® + with regard to the set yj, y^^ 

3. Find the rth polar of the binomial n-ic ax^^ + hx<^n with regard 
to yi, y£. 

Ans. 4 - 

8. Formal Definition of Invariant. 

If a binary form f be changed by a linear transformation T into 
a new form f', and a f unction I of the coefficients of f' be equal to 
the same function of the coefficients of f multiplied by a factor 
depending solely on the transformation, then I is called an invariant 
of the binary form f. The form f is called the ground form. 

Let us write 

J==Z(a) = Z(ao, %, aj, .... a„) 

to denote an invariant of the binary n-ic, whose coefficient set 
is 

a~ {uq, a^, ffg, . . . , u„). 

Further, let T : a?— ► x' induce the transformation a\ 

so that 7 (a') would mean the same function of the n -j- 1 argu- 
ments 

/ / / 

^0 j > ..«>»«• 


Then if 


7 (ct ) — (j) {l^ m Zg , m^ f ^^ 2 ) ^ (^) > 



VIII.] 


BINARY INVARIANTS 


139 


where <!> depends solely on the four quantities l^, nii, and 
not on a or x, I (a) is an invariant. 

This definition follows from Boole’s discovery. 


Examples , — A = (a^a^ — — 4(aoa2 ““ is invari- 
ant of the binary cubic {qq, a.^^x, y)®. 

For the quartic (ao» vY* invariants are 


I ~ — 4aitf3 -f 3^2^ J -- 


^0 ai «2 

a, a, a. 


9. Simultaneous Invariants. 

Let us reconsider our quadratic 

U — ax^ + "ihxy + by^y 
to which we adjoin a second, 

F = Ax^ + 2Hxy + By^, 


These in turn lead to a singly infinite system of quadratics typified 

by 

tJ + Ay = (a + A^)a;* + 2(h + XH)xy + (b + XB)y^. 


Now consider the discriminant of U + XV , 

a 4" h + XH 
h + XH b + XB 

It can be expanded in powers of A and written 

(ab F) + X{aB + bA~ 2hH) + X^AB ^ H% 

But if a linear transformation T changes x to x\ a to a\ 
we can write 

f7' + AF'-Z7+AF. 

In particular 

a' 4* XA'y hf 4" A/F, V 4“ XB^ 


are the new coefl&cients of the quadratic. Hence, by (7), we have 
identically 


a' + A^' A' + Aff' 
A' + AH' 6' + AB' 




( 1 4 * XA , Ji 4 * XH 
h + XHy b + XB 



BINARY FORMS 


140 


[Chap. 


This is true for all values of a, A, 6, A, J?, B, A and therefore we 
can equate coeflElcients of A on each side. Thus 

a'B' + b'A' - 2h'H' = I M |2 (aB + bA - 2hH), 

A'B' - = I M (AB - ff^). 


The first and third statements here tell us nothing new, but 
the second gives important information: it satisfies the charac- 
teristic invariant condition although it involves double as many 
coefficients a, A, b, A, H, B SiS the original quadratic. It intro- 
duces us to the new idea of a simultaneous invariant. 


Definition of Simultaneous Invariant. — If (a), (b), ... denote 
the sets of coefficients aQ, a^, . . . , Bq, . . . 0/ different quaniics, 
then I (a, b, . . .) is a simultaneous invariant of these quantics, 
provided 

I(a\ b\ . . .) = (f)(li, I2, m^l{a, 6, . . ,) 
identically for all values of the sets (a), (b), . . . . 


10. The Aronhold Operator. 

The above quadratic example leads to a general theorem due 
to Clebsch. Consider two y-ics 

and the pencil of ^-ics given by 

f/ + AF = (a^ + Xb^x^ -f- . . . -f- (a,. -|- Xby)x'^*~~^ f • 

+ K + A6^)/ (31) 

Here is an example of the addition theorem of liru^ar sets. In 
the matrix notation we could write the coefficient set of Z 7 + A 1 ^ 

[a-f A6]==[a]-^-A[6]. .... (35) 


Let T : a? — > cc' be a linear transformation changing V to 
IJ\ V to F', and therefore giving a/ linearly in terms of set [a], 
6/ linearly in terms of set [6]. Hence 

[a'+A6']=[a']+A[6'] ( 36 ) 



VIIL] 


THE ARONHOLD OPERATOR 


Now suppose aj, . . . , a„), written /(a), to be an invariant 
of f/, so that 

I (a') = (l> . I (a): 

then 

I (o! A6^) = . 7 (tt -f- Aft) 

identically for all values of A. Expand both sides by Taylor’s 
theorem and equate powers of A. The coefficient of A on each 
side gives 

/ Z. ' ^ 1 I I. / ^ \ T/« / ^ ^ 


0a7 da ') • • • ’ 


.oa„ da, 


Z(ao, a^). (37) 


This is usually written in the contracted notation 

Likewise the coefficient of A^ gives 

{b'^)'na') = 4.(bl)l{a) . . 


wffiere the arguments fto,fti, . • . , ft^ are independent of aj, . . . , 
and so must not be differentiated in the course of the work. 
But these last results yield functions of both sets [a], [ft] which 
satisfy the invariant condition. 

AVe conclude that the operator 


0 

= Sft,;f- 

i=.o dai 


. . . (40) 


applied to an invariant involving . . . , produces an 

invariant. For this reason it is called an invariant process. 
In particular from a rational integral homogeneous invariant of 
degree q in the set . . . , a^, it produces q — I simultaneous 

invariants involving both sets [a] and [ft]. 

The name Aronhold operator is sometimes given to ^ft ^~ ) , 
after one of the founders of the theory. 

Definition of Invariant Process. — If the effect of a process R 
applied to a function of the original coefficients a is the same as 



1^2 


BINARY FORMS 


[Chap. 

that of the process applied to the corresponding function of the new 
coefficients, save for the factor (j>, then R is called an invariant 
process. 

Formula (38) is an example of this; and it will appear that 
polarization in general is an invariant process. 


11. Multilinear Invariants. 


The invariant 


aB+bA-2hH, 


which is a bilinear form in the sets d, h, b and A, H, B, could be 
written 




j {ab — h^), 


as an illustration of the Aronhold process. Suppose, however, 

we had a homogeneous invariant of degree y in a set Oq, a^,. 

We wTite it t t/ v 

I = 1(a), 


and proceed to operate with (b 


»a‘ 


It produces an invariant 


homogeneous in both [a] and [6] of degrees y — 1 and 1 respec- 
tively. We could write it Iv so that 


Ilia', h') = 4>.li{a, b) 


identically for all values of [a], [6]. Now we choose a third quantic 
(Co, Ci,...,Ci,\x, yY 

and operate with {c on I, treating both [c] and [6] as indepen- 

dent of the set a^, a^, , a^,. The result as before is an invariant, 
this time linear in [6], linear in [c] and of degree — 2 in [a]. 
Thus 

Proceeding in this way to q operations involving q different 
quantics all of order p, we finally deduce a multilinear invariant 

C, ... ,h) 

involving q sets of coefficients [h] . . . , [k] of q different p-ics. 



VIII.] 


COVARIANTS 


M3 


Again, since the coefficients b^, ... ,]cp have been taken quite 
generally as independent oi a^, the Aronhold operators 

are commutative. In fact 






/(a) = (»j^) (6 »-)/(.), 


It follows that I,j is symmetrical in [b], [c] and therefore in all 
of b, c, . . . , k. This means that the 5 ! arrangements of 
b, c, , . , , k are equivalent, so that 

c, ,,,,k) 6, fc) . 


Further, by Euler’s theorem for homogeneous forms 
(a I (a) -^--= -=<il (a). 

^ Od' /c=o 

Hence we infer that the result of putting [ 6 ] — [a] in [b I (a) 

is merely to multiply / (a) by q. Thus we have a useful theorem: 

Every invariant of a binary p-ic f, homogeneous and of degree 
q in the coefficients of f , 7 nay be regarded as a special case of an 
invariant at once Imear and symmetrical in the q sets of coefficients 
of q binary p-ics. 

EXAMPLES 


1 . Form an invariant of two quartics (oq, y)^ and 

(6„, 61, 62* ^3» ^4 $ y)^ linear in each. 

Ans. «o^4 “ 40^63 H- 6a2&2 — 

2 . J^’orm an invariant linear in a quartic and of degree two in a 
quadratic. 

Hint . — Consider the square of the quadratic as a quartic. 


12. Covariants. 

Let us once more return to the binary quadratics 
U ~~ ax^ + 2hxy + by^ 

V = Ax^ + "^iHxy +By'^ 

and form their Jacobian (§ 6 , p. 124) or functional (letermiuani 

\dU dV\ 


(41) 




Sx dx 

dlJ dV 
dy dy 




(42) 



144 


BINARY FORMS 


[Chap. 


introducing various useful notations. This is 


ax + % Ax + Hy 
hx + by Hx + By 


= 4(ar‘ + 2M + yj,2), (43) 


say, which is another quadratic. Similarly, with the ac(;ented 
notation for the effect of the transformation T -.x—^ x', we 
have 

\a’x'-\-h'y\ A'x' + H'y'\ 


4 (a'x'^ + 2)8' x'y' -|- y'y'^) = 4 


J' 


h'x' + b'y’, H’x'-l-B'y' 


dV 

07' 

dx" 

dx' 

dV 

07' 

dy'’ 

dy’ 


(44) 


But U(x, y)= U'{x', y'), V{x, y) = V'(x', y'). Accordingly 

dv 

fx’' 


ZV dx dv dy^, du J dU 

'dx dx''^ dy dx'' 


and 


dU' dU , dU 


Similarly for V. Substituting in (44) we find 

du j du 

‘dx 

dU 


J' = 


^dx ^dy 


- 

cx 


h h 


mo 


dU 

'^^dy' 


07 


OX 


dV 

?jy 



dU 07 


dx dx 


dU 07 


dy dy 


=-AM\J, 


This introduces us to a covariant of the given forms t/, Vy 
namely a function of their coefficients, mid their variables x, y, 
which maintains itself after linear transformation, but for a factor 
depending solely on the linear transformation. 


Definition of Covariant : — In the notation already adopted, a 



MS 


VIIL] COVARIANTS AND LINEAR FORMS 

function C of sets [a], [b], . . . of coefficients of different quantics 
whose variables are Xj, X2 is a co-^)ariant, if 

G(a\ 6', . . . , x') = ^2, m^C(a, 6, . . . , cr) 

identically /or all values of a^, a^, . . . , b^, b^. . . . , Xj, X2. 

In the above example C is the Jacobian, which is a function 
of eight arguments a,h,b,A,H,B, x, y. But in detail it is bilinear 
in the sets a, h, b and A, H, B, while being quadratic in the set 
of variables x, y. 

In what follows our chief concern is with rational integral 
homogeneous invariants and covariants. 


13. Relation between Linear Forms and Co variants. 

The simplest quantic to deal with is the linear form 

E ~ e^x^^ (45) 


Presumably invariants exist involving this and other forms. 
For example, it is easy to verify that 

00^2^-- + ^ 2 ^ 1 ^ • . . . (46) 


is an invariant of E and the quadratic a^x^ 4 '^a-^x-^x^ + a^^x^. 
In fact if a a', e > e' denote the linear transformations, we 
have, by (32), 

~ l^Ci 4 - ^2^2 e ^\ M \-~ — ^2^2^ /47\ 

/ I ITI/Tl f 7 f* \ */ 

Cg ~ 4~ ^'2^2 1 AZ I == — ^ 1^2 

But 

^0 + 2% a?! 3^2 + 0^2 ^2 ~ ^0' ^1'^ + ^2 + ^2 ^ 2 ^ (^®) 


identically for all values of Xi, x^. So, in particular, let 
X^ = ^2 , 3^2 4 ‘ j 

then 

~ l^x^ + m^x^ — hef 4- — ^2 1 M I 

3^2 “ l'2^l H ' ^^'2^2 ” ^2^2 ~t“ ^1 ! ^ 1 


(49) 


Substitute in (48): then 

< 62 ' 2 - 2 <ei' 62 ' + I I ^ (%e^^-2a^e^e2 + a^e^^), (50) 

which exhibits the invariant property. 

This feature is true in general; indeed any polynomial co- 

( I) 884 ) 



146 


BINARY FORMS 


[Chap. VIII. 


variant can be looked upon as an invariant of the linear form 
61 + e^x^. For if C (a^, x^, X 2 ) denotes the co variant in question, 
ai standing for the coefficients of the ground form (or forms), 
then by the characteristic property 

C(ai\ x^) = <!> C{ai, x^, x^), . . . (51) 

Since x^ — X 2 ~ ty^ is a particular case of the linear trans- 
formation, then 

C(«/, yi, • • • (^2) 

where (f> depends solely on t. This implies that C is homogeneous 
in the variables y^, 1/2 as the contrary assumption at once is seen 
to be impossible when applied to (52). 

Hence (51) is homogeneous in x^, x^ and, let us say, of order 
tn. Using (49) this at once yields 

C(a/, ~ e{) = <^C(a,, - | ifcf |, | M |) 

= — 62, Cj). 

This shows that enter the function C precisely as 
do, so that the function C(a^, — is an invariant of the 

original ground forms together with the linear form e^x^ + e^x^* 



CHAPTER IX 


The General Linear Transformation 


1. Cogredience and Contragredience. 

The binary forms have served to introduce certain ideas 
which can easily be generalized. We shall now be concerned with 
forms in n variables 

aj= {*1, Xj, X3, a;,.}, 

which undergo a linear transformation 

= ^1 + 1?! V + • • • + a?,,'. 

Tx' a;; =- ^ 2 : 2 ' 4- • • • + . . (1) 

a:, I = ^„a;i' + rj^x^ + • • ■ + oi„ x,/, 


or : X— > x'. Let M denote the square matrix of coefficients 
. . . , and | M | its determinant, so that 


1 

a?l • 

, Oil" 

Lf. 

Vn • 





(2) 


which must not vanish. The variables and coefficients may be 
real or comjdex numbers. 

Let the co-factor of in | M | be A,-, and the reciprocal of 


1 M| be 


1 

m 



f . 

. t 


. 3 



( 3 ) 


SO that / 1 M I , &c. 


147 



148 GENERAL LINEAR TRANSFORMATION [Chap. 

If we multiply rows of by . . . , respectively and 

add, we have 

Xi ' = + ^ 2^2 + • • • + 

and similarly 

+ ri^x^ + . . . + rf'x^ 

(4) 

x^ = (jD^x^ + + • • • + 

This set of equations, which forms the inverse transformation 
of Ty. can be written 

Tji : (5) 

It exists provided | M | is non-zero. 

Now suppose we have a linear form 

{:u\x) = UiXy-\-u^x^-\- ... + u„x,,\ . . (6) 

let us consider the effect of the linear transformation (1) upon 
(u\x). Manifestly it gives a form linear in the new set [a;'] ; and 
if we denote the new coefficients by . . . , we have 

(u\x)= + . . . + uj„) Xi'+... 

+ + Mjcog + . . . + a;,/ = (m' I a;'), 

where the new coefficients u' are linear functions of the old. 
Accordingly we write 

+ ^2^2 + . . . + 
j ^2' = Vl'^l + '^2^2 + • • • + 

=^ 1^1 + a > 2«^2 + • • • + 

Here we have another instance of an induced linear transforma- 
tion w '— Uy where, it will be noticed, the coefficient matrix is 
the transposed of matrix M in (1). We therefore write it M\ 
By solving this set (8) we obtain the direct transformation 
which shall be denoted by T„, so that its inverse 
denotes (8). Thus 

V + ' n ^'^2 + • • . 

T * 

ti • 


Mn = fWl' + i'^2 + • • • + (o''U„'. 


( 9 ) 



IX.] COGREDIENCE AND CONTRAGREDIENCE 149 

In this way we arrive at four transformations T,,, 7,7^ 

as stated in (1), (4), (9), (8), whose matrices are M, 

M' respectively. When two such sets [x] and [u] undergo such 
transformations, (1) and (9), they are called^ contrcigredient sets, 
and the same name is given to the corresponding transformations 

T„. Further if , 2/,, is another set' of variables which 

undergoes the same transformation as Xg, . . . , namely 

y ^£2/2'+ • • • +^£2/// ] / ia \ 

J ^ ^ 

then the sets [aj] and [y] are called cogredient. 

The simplest formal definition of cogrediency and contra- 
grediency is to take them as follows: 

Two sets of n variables [x] and [y] are cogredient if a linear 
transformation of matrix M for x x' iyiduces the transformation 
y — > y' with the same matrix. Two sets [x] and [u] are contragredient 
if, when x — > x' and u — >• u', the inner product u^ remains an 
absolute invariant, namely 

u^x^ + . . . + u,,x,, = ufxf + ‘ • . f ufxf, . (11) 

or simply {u | x) {u' | x'). 

Starting from this fundamental condition, which must hold 
identically, we can at once deduce equations (8) from equations 
(1) by substituting in (11); or conversely. 

2. Linear Transformations in Matrix Notation. 

Let U denote the single-row matrix 

[u^, U 2 , u„] (12) 

and X the single column matrix whose transposed is 

= 0 ») 

Let X' = [xf,xf,..,,xf], . . . (14) 

Then the general linear transformation (1) is a direct example 
of the product of matrices, and can be written 

T^: X^MX, (15) 

1 Sylvester first developed this theory, and gave these names to the sets fwl, 
Cf. Cambridge and DuhVn MaihenuUical Journal, VI, VII, Vqil, iX 
<1851-4). 



ISO GENERAL LINEAR TRANSFORMATION [Chap. 

as is immediately apparent when it is written in full. Next let 
cogredient sets be denoted by single-column matrices Z, Y, 
Z . . . . If they transform to Z, Y, Z, . . . , then by definition 
of cogredient sets 

X = MX, Y=^MY, Z^MZ. . . (16) 

We deduce by fore multiplication with that 

M*“'Z==Z, Af-^Y=Y, M-'Z=Z. . (17) 


Again, by (11), the contragredient sets Z7, Z satisfy the 
identical condition between two inner products, which in matrix 
notation is 



UX=VX. . . . 

. . . (18) 

Hence by (15), 

VMX= VX, 


identically, so that 

VM=U, . . . 

. . . (19) 


which is the matrix equation for (8). Solving this we have 

U^UM^\ ( 20 ) 

which is the set of equations (9). By (9), p. 70, we may 
transpose these last results to 

giving the same actual equations when written in full. 

If F is a set cogredient with U, then by (19) 

F=FM, 

whence FZ = FMZ = FZ 

so that F is contragredient with Z. 

Thus we arrive at the conception of a number of matrices 
or vectors of the first kind Z, Y, Z, . . . , and a number of 
matrices or vectors of the second kind 17, F, IF, . . ., such that 
vectors of the same kind are cogredient with each other and 
vectors of different kinds are contragredient. All such matrices 
have rank unity (or else zero), for they each consist of a single 
row or column. Sometimes they are called tensors of the first 
rank; or order (cf. p. 91). . r 



IX.] APPLICATION TO GEOMETRY AND ANALYSIS 15 1 

The two chief applications of this co- and contra-gredience 
are first in geometry and* secondly in analysis, as the following 
merely preliminary examples are designed to show. 


EXAMPLES 


1 . .If “ .S, X may represent a point whose homogeneous co-ordinates 
referred to a triangle in a plane are 0*2, If u.,, M3 are homogeneous 
lino co-ordinates the equation of a straight line is 

4- ^2^2 + ^3^3 = 

If a new triangle of reference is chosen such that x/, .i*.,' are the co- 
ordinates of the same point as before, and m/, u./, M3' those of the same 
line as before, then the characteristic contragredient condition Vx ~ w'r' 
is satisfied. Hence in a plancN homogeneous Ime and point co-ordinates are 
contragredient sets. 

2 . Sets of co-ordinates of coplanar points X, Y, Z . . . are cogredient. 

3 . Sets of co-ordinates of coplanar lines 17 , F, W . . . are cogredient. 

4 . Points and planes in threefold space are contragredient. [Heie 

n = 4. J 


6. If 9 is a function of 2 variables Xy y and x — r cosO, y^r sinO, prove 

that the set [dxy dy^ is contragredient to 1 relative to the linear 

Lox dyj 

transformation of differentials from [dxy dij] to fdr, d0J. 

For let 9(0;, y) — 9'(r, 0) — 9. 


Then 

Also 


dO 


dx — cosO dr — r sinO d0 
dy — sinO dr -|- r cosO dO. 


These last give a linear transformation, of modulus r, for the differenti^s 
dx, dy in terms of dr, d 0 . The proof is now immediate. 

6 . Write down the induced linear transformation of 

ox ay 

COS0 -j- sin 0 — — r sinO ~ ^ .1 

L dr dx dy dQ dx dy J 

7 . If x^ x{p, q), y(p. q) prove that [dx, dy] and J are. still 

contragredient, relative to the linear transformation [d.2;, dy] — > [dp, dq], 

provided the Jacobian ^ f ~ neither vanishes nor is 

infinite. ^P 

[This Jacobian is the determinant of the linear transformation.] 

8 . Generalize this for n variables, x^, x.,, . . . , Xn, proving that 

[dx,, dx., dx,,} and rf5, f*?, . . . , 1 are contragreiHent sets. 

L.dXi^ dx 2 oX/iJ 



152 GENERAL LINEAR TRANSFORMATION [Chap. 

3. Orthogonal Transformations and Matrices. 

We gain a clearer idea of cogredience and contragredience of 
sets of variables by considering a particular case in which the 
distinction breaks down. It is called the orthogonal transfor- 
mation, an example of which has already been considered (§2, 
p. 130). But thegeneral orthogonal case ismost fruitfullydeveloped 
by starting vrith the characteristic property of contragredience of 
two sets (te) and (a*) and seeking to make it hold of a single set 
(x) with itself. 

Let X = AY he the linear transformation with non-singular 
matrix A for a column set X = x^, . . . , in terms of 

another such set Y = {y^, • * . , Then by transposition 

X' ==* ajg, . . . , x,,l Y' = [i/i, y^, • • • , 2/J (21) 

and, for the inner products, 

X'X=x^+x^-\- 

Y'Y=yi^^- y2^+ ... + y,,^=iy\y). 

• Definition of Orthogonal Transformation. — The homogeneous 
linear transformation from x^, Xg, . . . , x„ to y^, y2, • . . , yn 
orthogonal if the condition 

*1^ + -f , . . -f a;, 2 = y^^ yi y, 2 (23) 


is identically satisfied by 'performing this transfor^nation. 

This condition can be written in either of the equivalent 

forms X'X=TY, (x\ x)== (y\y). . . (24) 


To fix our ideas, let the typical case when w == 3 be taken 
Then we suppose that the following transformation is orthogonal: 

31 = 01^1 +612/2 + Cl 2/3. 

X2= 022/1 + 622/2 + 022/3. • . • • "( 25 ) 

^C 3 =« 32 /i + 632/2+03^3. 

also written 


■a^r 


61 Cl" 


■2/j‘ 


= 

• ^2 ^2 ^2 


2/3 

-a:3. 


- ^3 ^3 ^*3 - 


-2/!)- 


X = AY. 


or simply 



IX.] 


ORTHOGONAL MATRICES 


IS3 


Thus, if when w — 3 the subetitutions (25) are made in (23), 
the result is a quadratic condition involving terms in 

2 / 22/3 » 2 / 32/1 » 2 / 12/2 • Cilice this is true for all values of y^.y^, y^ 
the coefficients of these six quadratic terms must vanish identi- 
cally. This gives 

^ "h <!>2^2 + ^3^3 = 

K + V + K 4- - [- CgCfa -= 0, (27) 

^1“ + ^*2^ + <^‘3^ 1» %^>l+ «3^3 = 

which is completely specified by the matrix equation 


^1 ^2 ^3 


" “ 


- 1 . . - 

6i ha 


^2 ^2 ^2 

= 

. 1 . 

1 

1 


K ^ 3 - 


. . . 1 - 


or simply 

A^A^l, (29) 

which is also true for all values of n. 

This last result characterizes the orthogonal matrix, namely 
the product of an onthogonal matrix A and. its transposed A! is the 
unit matrix. 

Further if A' A I, then the product of the corresponding 

determinants gives 


|/l'! \ A\ - I =1 ... (30) 

so that the determinant | 4 | is ± 1- In either case the inverse 
A~^ exists, for the matrix is non-singular. 

Now 

AA'A = A(A’A)=:^AI = A. 

Hence by after-multiplication 

AA'AA-^ = AA-^ = r, 

so that AA' — 1: hence an orthogonal malrix commutes with its 
transposed, and 

A'A==AA'=1 (31) 



154 GENERAL LINEAR TRANSFORMATION [Chap. 

Conversely, if A' A = /, we deduce the original property (24) of 
the sets X, Y, For if 

X=A¥, X'== TA\ 

then X'X={Y'A^){AY) = Y'A'AY= riY= Y'Y, 

which exhibits the required property (24). 

If we expand the result (31) to its full implication, when 
n = 3, we obtain the six equations (27), together with a further 
six due to transposition. Thus we interchange the sets of suffixes 
and letters in (27) and obtain 

+ V + = 1 + ^2^3 + ^2^3 ^ 

V + V + ^2^ = 1 «3«1 + ^3^1 + ^3^1 = • ('^2) 

+ 63^ + <^ 3 ^ = 1 «1«2 + 61 ^>2 + ^1^2 =■"= 0* 


Similarly, for n rows and columns the conditions (32) imply 
conditions (27), and conversely. Counting the number of such 
equations (27) in general, the number of necessary and sufficient 
conditions for A to be orthogonal is 


„+(»)= 


Since A has elements, there are therefore 


n(n — 1) 
2 


arbitrary 


constants involved in an orthogonal matrix. 

A simple way to remember the conditions (27) or (32) is this: 
the inner product of two different rows {or two different columns) 
of the orthogonal matrix is zero\ that of a row or column with itself 
is unity. 

It will now be seen that the binary illustration of §2, p. 130, 
fits in with this general treatment of the orthogonal matrix. 
Further, since | -4 | = ± 1 the notion of right- and left- 
handedness can also be attached to a general orthogonal trans- 
formation. 


EXAMPLES 

1. If A is an orthogonal matrix, so is A': and so also are ^4“"^ 

r ~ 1 ^ orthogonal. 

Lsm0 cosOJ 



IX.] 


THE METHOD OF CAYLEY 


155 


3. If a; — x' COS0 — yf sin0, y~x' sin0 -}- y' co30, the consequent ortho- 
gonal matrix characterizes the rotation of rectangular Cartesian axes 
through an angle 0. Its determinant is unity. 

4. Show that f 1 characterizes a change of axes obtainecl 

Lsm0 — COS0J 

by rotation through an angle 0 followed by reversal of the axis of y\ Its 
determinant is — 1. 

5. The ternary {n = 3) orthogonal transformation, when | -4 | — I, 
characterizes a change of rectangular Cartesian axes with fixed origin, 
obtained by suitable rotation. 

[The matrix A gives direction cosines of old axes referred to new, or 
vice versa. | 

6. If I 41 I “ — 1 the change of axes involves reflexion together with 
rotation. 

7. An orthogcmal transformation^ when j A | — 1, also characterizes a 
movement of a rigid body about a fixed pivot. 

For if P, Q, X, Y are column matrices, such that P - AX, Q -- AY, 
then P'P— X'X, Q'Q— Y'Y, P'Q~- X'Y. And if these are interpreted 
geometrically for rectangular Cartesian axes, P'P means the square of the 
distance of a point P from the origin, while P'Q gives OP .OQ cosPOQ. 
Hence the matrix conditions show' that triangles POQ, XOY are congruent. 

4. Cayley’s Determination of the Orthogonal Matrix whose 
Determinant is Positive. 

Let be a general skew symmetric matrix and L be the sum 
of S and the unit matrix Z, so that if n == 3, we write 



and in general 

L^I + S, L'=/ + S'=Z-S. (34) 

Also let Z, 7, Z be the column matrices of three sets of variables 

{ Xj, X 2 , . . • j J {Vv > 2/n} > \^V ^ I • 

Then if X=LZ and Y = L'Z, the direct transformation from 
X to Y is orthogonal, and yields the general orthogonal matrix, 
whose determinant is positive, with the Jn(n ~ 1) elements of a 
skew symmetric matrix S for its arbitrary constants. 



156 GENERAL LINEAR TRANSFORMATION [Chap. 

In effect this is the theorem of Cayley^; nor is it difliculfc 
to prove. To fix our ideas let the conditions be written in full, 
when n = 3; namely X = LZ, Y = UZ become 

iEi = Zi 4- C22 — 6% yi = 2i — CZ2 + 6Z3 

iCg = — czj 4- Z2 4- Vi — cz^ 4* Zg — 023 • (35) 

®3= 6zi— az2 4 - 23- 2/3= — ^ + “22+ 23 

According to this theorem the effect of solving for the set z in 
terms of x and substituting in the set of equations for y, will 
give us an orthogonal transformation from y to x» In fact 
since X = iZ, therefore Z' = Z'i', so that Z'Z = Z'i'XZ. 

Similarly Y'Y^ZLLZ. 

But ■ LU = (/ + /S) (/ + S') ^ (I + S) {I- S) 

and L'L = (I + S') (I + S) = (I — S) (I + S), 


Hence Z'Z = Y' Y, which proves the orthogonal property. 

Further, we have Y~L'Z, so Z~ L'^^Y, provided L' is 
non-singular. Hence 

Z=ZZ=Li'“'y (36) 


Since L commutes with L', it commutes with L' 
can be written without ambiguity as 

L _I + S 

L' 1-S 


hence LL' ^ 
. . (37) 


Thus the matrix which can be put into the form 

I-\-S 

I-S' 


where S is an arbitrary skew symmetric matrix, is orthogonal. 

There is no difficulty in calculating since {I~\-S) (I — S)"^^ 
is given by L and the inverse of L', In the case when 



» Crelle, 32 (1846), 119-123; Collected Works, 1, 3,32-336. 



ix.l 

wc find 


RODRIGUES’ EQUATIONS 




pj -|.a2_?,2_c2 

2{ab-c) 
l + a®+6®+c®’ 

2(ac+6), 


2(ab+e) 
H-a>*+ 62+^2’ 

l-a2+62_c2 


2{ac--b) 

2(bc+a) 


l + a2+62+c2’ ]-rtt2+62_|_c2 


2{bc—a) 


-62 + c2 


,l-[-a2 + 62_|.c2’ l _l_o2q.t2H.-c2’ 1-f a2 -f- ^2 J 


157 


(38) 


and the orthogonal transformation is 


- 


■ Vi ■ 

X2 

= A 

2/2 

^3 - 


- 2/3 - 


These formula) written in full are known as Rodrigues’ equations.^ 
They were also known to Euler (1770). Their chief interest is 
that they give a rational solution of the problem, the simplest 
case, when n—2, being familiar in the form of finding rational 
lengths for the sides of a right-angle triangle. 

To obtain, by any other means, a set of rational values of 
direction cosines of three mutually perpendicular lines in space 
referred to rectangular Cartesian axes is a difficult problem, as 
an attempt will readily show. 


EXAMPLES 

r 

1 . Verify that A — \\ —2 1 2 gives an orthogonal transformation. 

L-2-2-1J 

2. Prove that if A is the Cayley orthogonal matrix, then | .4 | = 1. 

3. If p rows or p columns of the Cayley matrix are multiplied by — 1, 
the result is an orthogonal matrix whose determinant is i 1 according as 
p is even or odd. 

[Apply the detailed test as in (37) ]. 

4 . If J denotes the unit matrix with p negative and n — p positive 
signs attached to diagonal elements, then «/(/ -j- ^)/(I — >S^) is the general 
orthogonal matrix, whose determinant is i 1 according as 2 ' is even or odd. 


^ Rodrigues, Journ* de Liouville de Math,, 5» 404-405. 



IS8 GENERAL LINEAR TRANSFORMATION [Chap. 


5. Linear Transformation with Absolute Quadric. 

An important corollary follows, which concerns the general 
quadratic relation 

llaijXiXj^'S.aijyiyj, .... (39) 


analogous to the simpler case already taken. Can a linear trans- 
formation of variables 


X' = [Xi, x„], Y' = [yi, y^, y„] (40) 

from JSl to y be found, such that the above quadratic relation 
is identically satisfied? The answer is given by use of a sym- 
metrical matrix Q — [a^j] = [aji ] ; for the quadratic itself may be 
denoted by the matrix product 

X'QX (41) 


For example, 



"a h g~ 


-x^- 

[»i, 2 : 3 ] 

h b f 


X 2 


1 


— 1 


= + bx2^ + cx^^ + 2fx2 x^ + 2gx^ x^ + 2hx^ x^, (42) 


Our condition is now X'QX = Y'QY, and this is secured by 
taking as our linear transformation 


y I+SQy 


(43) 


S being an arbitrary skew symmetric matrix. For by transposition 




yJ-QS 
I + QS’ 


(44) 


since Q' — Q, S' = 


- S. Hence X'QX = Y'QY provided 

I-QSqI + SQ 
I + Q~S^I-SQ 


= Q- 


(45) 


As in (37) the fractional notation is imambiguous. Multiplying 
fore and aft hy I QS and I — SQ respectively we have 

(7 - QS) Q(I + SQ) = {I + QS) Q (7 - SQ), (46) 


which is true on expansion without the commutative law of 
multiplication. 



IX.] 


HERMITE’S THEOREM 


159 


This matrix \ ^ 

I — SQ 


is a function of a single argument SQ. 


Another matrix which has the same property, but which is 
not a function of one argument, for the order of its factors 
is non-commutative, is given by the following theorem. 


Hermite’s Theorem. — The matrix (Q + 8)“^^ (Q — S) gives 
rise to a linear transformation which leaves the quadric X'QX 
unchanged. 


Proof . — 

Let R ~ {Q -\~ S)~^ {Q — S), m that, by the reversal law, its 
transposed is 

R'--{Q + S){Q~Sr^ (47) 

Hence 

R'(Q + S)R ^{Q + S)(Q - Sr^Q + S)(Q + Sr^{Q - S), 

^Q+S (48) 


after cancelling the third and fourth factors and then the second 


and fifth. Similarly 

R'(Q-^S)R=:=Q^S (49) 

Adding these results we have 

R'{Q + S+Q-S)R--r-- 2Q 

or EQR=~Q (60) 

By subtraction, R' SR = S (51) 


Hence as in (45) the requisite condition is satisfied, so proving 
the theorem. 


Corollary I. — The same matrix leaves the shew symmetric 
bilinear form 

X'SY = ^x,s.jyj (s,v =-«,•.) 

^unchanged, as is indicated by (51). 

Corollary II. — The 'pencil of bilinear forms 

whose matrix is AQ -1- /xS, is also left unchatiyed. For by (50) 
and (51), if X and /* are scalar, 

R'{XQ + (iS)R--=XQ + (iS. 



i6o GENERAL LINEAR TRANSFORMATION [Chap. 
6. Group of the Orthogonal Matrix. 

Theorem. — The product of two orthogonal n-rowed matrices is 
orthogonal. In fact, if A and B are each orthogonal w -rowed 
matrices, then 

AA^ - BB' = A^A = FB - /. 

Hence 

ABB'A' = AIA'=AA' = L 

Thus if C = AB, then C' - B'A' and CC' - ABB' A' = Z, 
which proves the theorem. 

This result is obvious geometrically by interpreting each 
matrix as a suitable rotation of axes about a fixed origin. 

Such a result typifies a property of fundamental importance 
throughout mathematics, namely the group property. In its 
general form the group is defined as follow^s.^ 

Definition of a Group. — A system consisting of a class of 
elements A, B, C, . . . , and one rule of combination, which will be 
denoted by o, is called a group if the following conditions are satisfied: 

(1) If A and B are members of the class, whether distinct or 
lot, AoB is also a member of the class. 

(2) The associative law holds, namely 

(A o B) o C ~ A o (B o C). 

(3) The class contains a member I called the identical element, 
which is such that every member is unchanged when combined with 
it: thus 

AoI^IoA=^A. 

(4) Answering to each meunber A is a member A”'^ called the 
inverse of A, such that 

Ao(A'-^)== (A-^^)oA = I. 

Simple examples of groups, which obey these conditions, will 
at once occur to the reader. Positive and negative integers with 
zero form a group, if the rule of combination is addition. In 
this case the inverse of .4 is — while / = 0. So the class of 
integers is a group for the operation addition. But not so for 

^Cf. Bocher, Higher Algebra (New York, 1919), p. 82. 



IX.J TRANSFORM A riON GROUPS i6i 

subtraction; nor even for multiplication, since condition (4) 
breaks down. 

Non-singular matrices of the same order form a group for 
multiplication, the identical element being L 

The totality of all displacements of a, rigid plane lamina in 
its own plane form a group, if we allow the null displacement— no 
displacement at all — to act as the identical element of the group. 

Again, the totality of all linear transformations, from a set 
X of n variables to a set Y, form a group, provided the matrix 
M of the transformation is non-singular. Thus if Z == M Y is a 
transformation X Y, and Y ™ NZ is another linear trans- 
formation, then 

Z - If Y - M(NZ) = (MN)Z. 

Hence (MN) the product of the matrices M, N determines the 
linear transformation from Z direct to Z (§2, p. 59). We may 
evidently speak of the transformation M (or Z, or MN), meaning 
that for which M is its coefficient matrix, as in §1. So if 
1^1. 1^1 are non-zero, the inverse transformations exist, and 
group condition (4) is satisfied. If in particular N — then 

Z-Af3f~'Z=--7Z, 

which gives the identical transformation of the group, namely 

7. Dimensions of the Transformation Group. 

Consider the two matrices M and Z, each with elements, 
not entirely alike, so that M ^ N, Then MY NY, so that 
we may properly speak of the transformations 

Z-MY, Z-ZY 

as distinct. For they give different values of the set Z answering 
to one value of the set Y. Thus the transformation M is said 
to have n^ dimensions, for it is not specified unless all its n^ 
elements are given, whereas these n^ elements determine it 
uniquely. 

Definition of Subgroup . — A snbgrouj) of a group is an 
aggregate of members which themselves form a group, with the 
same rule of combination. 

( D 884 ) 


12 



i62 general linear TRANSFORMATION [Chap. 
For our purpose the following examples are important. 


The Projective Subgroup. 

All matrices pM, oM, rM . . . where p, a, r , are non- 
zero scalar factors are members of a group, hlach is obtained 
from another by scalar multiplication, and all the group conditions 
are satisfied. Applied to a linear transformation Z — > Y such 
matrices differ merely by multiplying the co-ordinates t/j, y ^, . . . , 2/^ 
by a constant factor. Now this difference is immaterial for 
homogeneous co-ordinates in geometry: accordingly these trans- 
formations are indistinguishable. So if the — 1 ratios 

: ^2 • • • • ‘ elements of M are given, one such 

transformation is determined. The totality of these transform- 
ations forms the projective group of {n — l)-fold space, and 
therefore its group dimensions are - 1. 


The Affine Group. 

The matrix 



... 


COi 


. 

•• Cl 

M,= 

^n-l 



, W„4:0, A = 

A . , , 

.. C„-i 


.0 

0 





with n — 1 zeros in the nth row defines a group of transform- 
ations. For the product of two such still has the requisite 
zeros; and each group property, including the existence of an 
identical member, is secured. Here then is a subgroup of the 
general transformation group. Since has n^ — n -f 1 arbitrary 
elements, this number gives the dimensions of the affine group 
for (n — l)-fold space. 


The Affine Group with a Fixed Point. 

This is defined by 



r^i.--Ci 

0 ■ 

II 

• • • • < 

1 Ctt— 1 

0 


.0 ... 0 

<^n. 


a>,j 4= A 4= 0 


with (n - 1)^+1 group dimensions. Here there are n — 1 



IX.] 


SUBGROUPS AND COMPOUNDS 


163 

zeros in both the last row and column. Again the group pro- 
perties hold. 

The Orthogonal Affine Group with a . Fixed Point. 

The matrix -^2 rninor 

is orthogonal defines a group, as is easily verified. The \n{n — 1) 
conditions needed to make this minor orthogonal cut down the 
group dimensions to 

(n - 1)2 + 1 - \n{n - 1 ) = | ~ + 4 ). 

EXAMPLES 

1. The affine group leaves the equation of a certain prime (linear form) 
absolute; namely, Xa ~ 0 becomes x,i ~ 0 when x — > x'. 

If w = 3, this is illustrated by taking x^jx^, ^*^ 2/^3 as Cartesian oblique 
co-ordinates and regarding the transformation as a change of axes. 

2. Keeping the same axes and regarding the transformation as a change 
of figure, the point x moving to x\ prove that the affine group (w= 3) 
changes point to point, line to line, and parallel lines to parallel lines, 

3. The ifg group can be regarded as a change of axes without shifting 
the origin. 



8. Induced Compound Transformations. 


In §8, p. 86, certain compound co-ordinate sets 772, 773, ... , 
Vn-v P/1-2J • • • were introduced as a direct application of the 
determinant theory. We now proceed to develop their properties 
in relation to the Sylvester theory of cogredience. 

Let ic, y, z, . . . , s, ^ be any number h of cogredient variables, 
r of which we choose to form a matrix of r rows and n columns, 
say 

r 2^1 ^ » 1 




2/1 2/2 



This has determinants of order r, making a set which we call 
provided ran. 

Definition. — The t-rowed determinants of the set are called 
xth compound point co-ordinates. 



i64 general linear TRANSFORMATION [Chap. 

Accordingly the sets • • • are particular second, third, 

. . . compound point co-ordinates. If we abbreviate the set 
TTg = {xy)afi as xy, and ttq as xyz^ and so on, we consider all the 
various sets 

xy, xz, . . . , xt, yz, . . . , st 


as second compounds: and similar remarks apply to rth com- 
pounds. Thus from k given points x,y,...,twe derive rth 


compounds. 

In particular, if r = n — 1 the rth compound is a prime 
(§8, p. 86). This is true of any {n — l)th compound. Also if 
r = 1 we revert to the original point type x or y ,,, or t. And 
once more, if r== n, the matrix [p,.] is square and has a single 
determinant | . This gives a set of n points which form a simplex, 

provided | pn | ^ • otherwise the points are linearly related. 

We now come to an important theorem. 


Theorem . — A linear transformation T of cogredient variables 
X, y, . . . <0 x', y', . . . respectively, induces a linear transformation 
upon all their compounds xy, xyz, . . . , such that all xth compounds 
are cogredient. 


Proof . — 

This follows immediately from the theorem of corresponding 
matrices (§4, p. 79). For by (4), p. 148, which we write shortly 
as 7^ = xf = 7]^, &c., we have for cogredient variables 


yi=^y> Zi' yn=Oiy, 


Hence 


(a5'y')i2== 




Vy 




which shows that the new compound co-ordinate (x'y ')^2 ^ 

linear function of all the old (xy)ij. Similarly, if 0, (f> belong to 
the ath and jSth rows of (4), 

= {e<f> 1 xy) = 

Hence the second compound x'y' is transformed linearly to xy 
by a matrix whose elements are the second compounds of the 
elements of which transforms a?' to x and y* to y. The same 



IX.] COMPOUND ORTHOGONAL GROUPS 165 

matrix arises whatever pair among x,y,z,...,t is first selected. 
So all second compounds are cogredient. 

Likewise for third compounds 

leading to a similar result; and so on until the (n — l)th compound 
is reached, in which case the typical equation is 

{xYz' . . . s')23..,„ = . . .(oyj -^ixyz . . . s)y . 

The coefficients in this series are proportional to 

which leads back to the result that the transformation of (n — l)th 
compounds is cogredient with that of u and contragredient to x. 

This puts the Sylvester-Cauchy theorem (§ 9 , p. 87 ) on com- 
pound determinants in a new light. For we have now arrived at 
a system of compound matrices, say M, . . . , 

whose determinants | M |, | Mg |, ... are what have been called 
compound determinants. Since each determinant, according to 
this theorem, is a power of | M | , it follows that none of these 
compound linear transformations are singular unless that of x 
itself is. 

Corollary I . — The correlative compounds p,. undergo linear 
transformation, such that pj. and ^T^. are contragredient. 

This follows at once from §8, p. 86. 

Corollary II . — If the transformation T : x — > x' is orthogonal, 
so also is each compound transformation. 

For if a, j 3 , y . . . denote any of the rows , a> in the 

transformation matrix M, then 

(a|a)=l, (alj8) = 0 a 4=^, 

when M is orthogonal: whence, by the theorem of corresponding 
matrices with r letters both before and after the vertical line, 

(ajS . . . I yS . . . ) = 1 or 0 

according as a ~ y, jS = 8, ... or at least one of y, 8, . . . differs 
from a, jS, . . . . These conditions at once imply that the rth 
compound is orthogonal. 



i66 


GENERAL LINEAR TRANSFORMATION [Chap. 


9. Connexion between Matrices and Quaternions. 

The theory of four-rowed orthogonal, matrices is intimately 
connected with that of quaternions. If we introduce into non- 
commutative algebra three elements i, j, h called complex units, 
defined solely by the equations 

then a quaternion is a linear function of i , j, k 
q=ix+jy+kz+t, 

where a?, y, Zy t are scalar. 

If Xy y, Zy t belong to the field of real numbers, q is called a real 
quaternion; if to the field of complex numbers (a + V — 1 jS), 
5 is a complex quaternion. 

The quaternion 

g' = — ix—jy -kz-^r^ 

is called the conjugate of g; it satisfies the scalar condition 

-i- if + 

analogous to the property of conjugate complex numbers 
a + — 1 and a — ~ 1 jS. This quadratic expression 
a;2 y2 _L 2;2 O- ^2 jg called the norm of q. 


EXAMPLES 


1, Prove that the two-row matrices 




[v 


0 

=^1 oj 


satisfy the above properties (1) and verify that J, k as defined by (1) also 
satisfy the associative laws. 

2. A quaternion q is expressible as a two-rowed matrix 

(7= r 

^ L— earj 

where t denotes '^—1. 

3. Prove that the matrix product qq* is scalar, where q' is the con- 
jugate of q. 



IX.] QUATERNIONS 167 

4 . If Zy z are conjugate complex numbers, and also w, wT, then 

I — - I is a quaternion. 

L’-Wy z J 

Prove the reversal law for the conjugate of a product of quaternions 
Py q, namely 

5 . The norm of a product is the product of the norms. This generalizes 
the well-known theorem for the product of moduli of complex numbers. 

Justify the steps in the following proof: 

If r = pqy then r' q'p\ So 

rr' = pqq'p' = p{qq')p' = (pp') 

6. Taking p =-• + jp + ky+By q== ix-^^jy + Jcz+ t express r = p^ 

in full as a quaternion. Hence by 5 , prove the identity 

(a2 82) (^2 2/2 + _j_ ^2) ==. X^+ Y^ + 

where Z = — yy ^z oLt 

Y = vo: + Sy — 02! H- 
Z = — pa? + ay 4 - 82 4 - 
T = — oix ^y — yz + ht. 


1 . If ^2 2/2 + = 1 

5 —Y P * 

A- y 8 -a P 

-p a 8 Y 
a — P — Y 8 

are orthogonal matrices. 


a2 4“ p2 _j„ y 2 4 §2, prove that 


and B = 


t z —y a? 
-3 i X y 
y —X t z 

-X — y —2 < 


rZi Z2 Z3 Z4- 

o If ^1 ^2 ^3 ^4 denotes the product ABy prove that tlie 

Z^ z, 

It, 2\ 2\ T 4 J 

sum of the squares of elements in each column is 

(a* 4- P2 4- y 2 §2) (a;2 ^ 2/2 4* 4- 



CHAFfER X 


General Properties of Invarianis 

1. Linear Transformation of the General Form of Order f. 

Certain theorems apply equally well to forms in n variables 
aji, x^, . . . , 5C„ as to binary forms; so we shall now consider 
them. 

Let ^ ...x/i .... (1) 

be a form of order so that 

«; + A +•••+»'<=" P • • • • (2) 

for each term of /. We replace c^ by a multinomial coefficient 
together with an arbitrary coefficient such that 


Let N be the number of different terms in the general ^j-ic /, 
that is the number of different values of the matrix 

[«£, A, Pi] (4) 

Then N is also the number of different terms in the special 
form when = ag = ...== a y~ 1, namely 

(Xi + *2 + . . . + x,f (5) 

With this understanding we WTite 

/= (a^, ^2, . . . , , X2i .... =f{3o) (6) 

in the contracted functional notation. 

Now let r : a? — > aj' be a linear transformation with n equations 

+ Vi^2+‘--+ • • ( 7 ) 

1(J8 



Chap. X.] 


DEFINITION OF INVARIANT 


169 

where the square matrix M of the w® coefficients . . . , has 
a non-zero determinant \M\. Subject to this sole condition, 
the coefficients are arbitrary independent real or complex 
numbers. 

As in binary forms the effect of this transformation upon the 
p-icf(x) is to i)roduce a new y-ic /'(»')• Thus 

/=/(a;) =-f{x') = {ai, a^x^, x^, xj 

= (cq', a,', . . . , aj i x,', x^, . . . , x.J, . (8) 

defining a set of iV' new coefficients [a'] analogous to (3). In 
fact a/ is the coefficient of . . . x,l^^ after removing 

the multinomial factor. But this is found on the left-hand side 
by picking out the required terms in the expansion of each 
separate term. For our present purpose it is sufficient to observe 
that eaxk al is a linear function of a^, , Uy, We typify 

this by 

T : x-^ x\ a\ 

2. Projective Invariants. 

Definition oi Invariant . — A 'polynomial function I (a, b, . . .) of 
the coefficients a, b, ... 0 / forms f(x), g(x), , . , is a polynomial 
projective invariant if 

/(a',6', ...)^f^(|)7(a,6, ...) ... (9) 

identically, where is a factor depending solely on the n^ 
coefficients ^1, of the transformation x — > x'. 

Such a function is a relative projective invariant, to give it its 
full accepted title, biit briefly it is called an invariant. We 
prove a few theorems which hold of such functions 7. 

Theorem l.—The factor is a positive integral power of 
the modulus | M | 0/ the Iransfonmtion x — > x'; namely 

■ • ( 10 ) 

Proof , — 

For if 1(a) denote such an invariant oi f(x) then 
I(a^)==<f>{i)I{a). 

But suppose we start with f'(x'), then 7(a') is an invariant of 



170 GENERAL PROPERTIES OF INVARIANTS [Chap. 


/'(»'). If we now transform /' back to / by the inverse trans- 
formation x' —*■ X (cf. (5), p. 69), 




Xi -j- 


3 

\M\ 


^2 • • • + 



and the condition analogous to (9) is 

m- 

Hence if I {a) + 0, we obtain by multiplying these results 


™ 



Hence w’e can clear the denominator of (11) on multiplying 
through by a suitable power j M |* and, after expressing each 
co-factor Jf of | ifcf | in terms of the elements , we obtain 

( 12 ) 

where both ^ and ifj are polynomials in their arguments. But 
\ M \ is an arbitrary determinant and therefore has no factors 
rational and integral in its elements. Consequently both 
and are powers of | ilf |. Thus ^(^) = | ilf |'^ = A”'. 

This index w is called the weight of the invariant J. 

Corollary. — If Ij, Ig are two invariants of weight w^, Wg 
respectively, then 

Hence the product is an invariant of weight w^ + Wg. The 
quotient Ij/Ig satisfies the condition of invariancy, and is called 
a rational invariant of weight w^ — Wg. 

An algebraic invariant is the root of an equation 

ir^o, 

where each coefficient is a rational integral invariant. 



X.] 


WEIGHT AND HOMOGENEITY 


The sum of two invariants is only invariant if their 

weights are equal. For 

and if the left-hand side is invariant it has just been proved 
equal to A"’(/i + 1^. Hence lo — w^. 

We sum this up by saying an invariant is isobaric. 

EXAMPLE 

Show that the present definition of weight w agrees in the binary 
case with the definition already introduced in §4, p. 134. 

3. Homogeneity of Invariants. 

Consider a number of given ground forms / (a;), g (x), . . . 
whose coefficient sets are [a], [6], .... Let / be a simultan(3ous 
invariant of weight so that 

I{a\ 6, ...). 

We shall prove that it may be sorted out uniquely into a number 
of terms homogeneous in each set [a], [6], , . . , each such term 
being an invariant. 

Theorem II. — Every simultaneous invariant can be expressed 
in one and only one way as a sum 

/=/' + /"+. ..i» 

of invariants I which are ecwdi of weight w and homogeneous in 
each set of coefficients involved. 

Proof — 

First let all terms be reduced as far as possible, terms with 
the same index set, §1 (4), being collected into one term. If 
I is not homogeneous in each set [a], [b], ... let it be written 

I=:=I^{a) + I^{a) + .., + IM, 

where each term in this sum is homogeneous in each set. 

How by definition we have 

Z(a') ~(f>X {Zi(a) -f /2(^) + • • • + • 

Therefore 

Ilia') + hia') + X { A (a) + • ■?.(«)} 



172 GENERAL PROPERTIES OF INVARIANTS [Chap. 


identically. Also ^ is independent of a,b, , while a' is linear 
in a; 6' in 6; .... Hence the only part on the left-hand side 
which is of the same degree in a as Zi(a) on the right is 
so that 

Thus Ii{a) is an invariant. 

For example 

— 2ai 

is a simultaneous invariant of the two binary quadratics 

(tto, «!, 0^2 \ ^ 2 )^ and (60, 61, &2 $ ^ 2 )^ 

but it is the sum of the two expressions 

%^2 — and — ^(i\b^ + ^^2^oj 

each of which is homogeneous in the two sets of coefficients. 
Calling these invariants and /g, they both have weight two, 
but they differ in degree. The degree of an invariant of a single 
form is its degree in the coefficients of the form. So has degree 
two and weight two. In keeping with this definition, /g is said 
to have partial degrees (1, 1) in the respective sets of coefficients, 
and again its weight is two. 

4. Ground Forms. 

Definition . — The form or forms which give rise to invariants 
are called ground forms. The coefficients of terms in the forms are 
ground coefficients. 

It should now be clear that three essential things are involved 
in the invariant theory: the ground form, the transformation, 
and the invariant. In its general aspect the problem before us 
is to discover whether a function, say /(a), exists, and if so how 
many such functions exist. To these questions a general answer 
can be given, not unlike the corresponding answer to the question 
whether a "given equation, algebraic or differential, has a solution. 
The results are crystallized in the great theorems which follow 
later, associated with the names of Clebsch, Gordan, and 
Hilbert. 



X.] 


SYMBOLIC NOTATION 


173 


5. Symbolic Notation. 


The reader is already familiar with differential operators 
which combine very like ordinary numbers. Suppose, for 
example, that are independent of x and y. Then we may 
write 




^ X2 


0 /;/* 

dx^'~'^dy 



(13) 


identically, provided j) is a positive integer and / a function 
cap^le of such successive differentiation. 

In particular let 

f-=^aQX^^-\~])a^x^^-^y-\- + . . (14) 


then each ptlfi derivate of/ is a single term, and, in fact. 


Hence identity (13) now takes the form 

= p\{af,Xj^’ + pa^x/-^ X 2 + . . . + a.„x/). (16) 

Let us now introduce the following notation: 


so that for extreme values of r, 


®2 — 5 ..,, • 


dy’^ 


Then owing to the convenient fact that the differential operators 

— — combine as mirabers and obey the index law, so also 

dx dy 

do ttj, oj. This becomes plainer if we take an actual example, 
say ns-* 



174 GENERAL PROPERTIES OF INVARIANTS [Chap. 
which would occur if = 5. This would be written as 

Manifestly we should have such relations as 

= &c. ; 

for they are all ways of writing the same quantity 

dx^dy^ 

So we have introduced two symbols a^, Ug which have the pro- 
perties of ordinary numbers with this proviso: they only %ccur 
in a product of degree p, involving p — r factors and r factors 
ttg: otherwise they are undefined. 

Now let us extend this definition. Let and Ug occur in a 
product of degree 1, 2, 3, . . . , or jo, and behave like ordinary 
numbers, but let a product of degree greater than p be undefined 
and therefore meaningless. There is clearly no contradiction 
involved in such a restriction. 

If we substitute from (17) in (15) we obtain the elegant result 

a/= Uq, a/~^a2== «!, . . . , = a,,, . . . , (18) 

The result when put in (16) gives 

(ariOj + = «© V + ...-I- a^x/. (19) 

Since x^, are independent of x and y, they combine with the 
differential operators and therefore with and as with ordinary 
numbers. So we may write 

{a^x^-\- a^x^^ == a^x^ + pa^x^-^ x^-\- . . . + ( 20 ) 

This is an identity for Xj, x^, obviously agreeing with relations 
(18). 

These oq, Og are the Clebsch-Aronhold symbols, foimded on 
the hyperdeterminants of Cayley, which have proved to be of 
the utmost value in developing the general theorems of t'^ie 
invariant theory. Now that they have been defined we can 
dispense with all that precedes (18) and (20) by marking the 
following doctrine of these symbols: 

The symbols a^, Oj behave as ordinary numbers. They have 



X.] THE CLEBSCH-ARONHOLD SYMBOLS 


175 


no actual meaning as numbers excerpt when they occur in a product 
involving exactly p of them. 

Thus the symbols express the coefficients a^, . . . , of 
the binary ji?-ic/ explicitly and uniquely, and any linear function 
of the coefficients can be written unambiguously by means of 
the symbols. Indeed they express the binary p-ic itself as a 
perfect pth power of a symbolic linear form f- UgXg. 

If in particular the binary p-ic happens to be a perfect pth 
power, the symbols represent actual numbers. This is called the 
scalar instance of the general symbolic form. 


6. Symbols for Forms in Three or More Variables. 


Exactly the same methods may be used to denote a homo- 
geneous form in three or more variables, ^^ 3 ? • • • by means 

of symbols a^, Ug, ... . 

From an identity analogous to (20), we should arrive at 
the result 


(tti -f- 02 Xj + 03X3)'' = ! I a,j^ Xi‘ X2' X3*', . (21) 


where i j Jc— p^ the summation extending to all different 
values of the index matrix [/, A;]. The only essential difference 

here is in choosing a suitable notation for the coefficient of the 

tern«y;.ic. . . . + 


which now takes the place of (14). Whatever principle of suffix 
or other notation is adopted on the right-hand side of (21), it is 
agreed that a product of p symbols 

multiplied by the trinomial coefficient actually repre- 

sents the coefficient of Xi x^ xf in this ternary p-ic. 

If = 2, the coefficients are best denoted by double suffixes 
in all cases involving many variables. Thus 

f^ IsI^a^jXiXj 

i j 

i=l,2, J==],2, 


is tl)e quadratic form in n homogeneous variables. 



GENERAL PROPERTIES OF INVARIANTS [Chap. 

For instance, in this notation the areal or homogeneous equa- 
tion of a conic is 

/ = V + «22 V + «33 »3^ + 2023 *2 ®3 + 2031 *3 + 2ai2 X^ , 

which is symbolically written 

(01X1-1-02X2 + 03X3)2. 

This leads to the very simple definition of symbols for a quadratic, 
namely 

0,0; = a-ij. 

So also, in accordance with the defined behaviour of the symbols, 

CLjCLi dji 

Similarly the quaternary quadratic is denoted by 

and the general quadratic by 

(aia^i -I - agiTg + • • • + 

where as before 

a, a; 

Cubic Forms. — Next if 79 — 3, a triple suffix notation is 
convenient, so that the general cubic is symbolized by 

(ttj + Gg Xg + . . . + a,, X, J®, 

where gives the coefficient of apart from the multi- 

nomial coefficient, in this case 1, 3, or 6 according as i — j :=z k, 
or two only are equal, or all differ. 

The General Form o! Order p.— This is best denoted by 
attaching a group of p suffixes for a coefficient. Thus 

/=2...Say»...„, XiXjX^...x„„ 

i in 

the summation extending from 1 to n for each of the p suffixes 
i, m. This will give all possible products x,; . . . x,„. of 

degree p in the set {xy, . . . , also each term obtained by 
deranging the suffixes of a given term will be of the same kind. 
We may therefore define all different permutations of suffixes 



X.) 


SYMBOLS OF GENERAL FORMS 


177 

equal. This allows, as a simple consequence, 
the symbolic form 

to represent /; so that, by equating coefficients, 

dlCLj . . . €L„I - . lu^ ^jife . . . Ill ~ 

And lastly, if we carefully distinguish between single suffixes 
and multiple suffixes, we may with advantage use the letter a 
rather than a for the symbol, and write 

(lidj . . . ^ijk. . . Ill 

for the typical coefficient of the ^-ic /, now symbolized by 
(aia-i + %a:2+... + a„x,Y, 

Here the actual coefficient is said to be resolved into its 

syfnbolic factors a^, . . . , : and conversely, a symbolic 

product is only an actual coefficient if it contain exactly p factors 
a,;, . . . . In particular if p =“ 1, or if / itself is a perfect pth 

power, this distinction breaks down, and the form provides its 
own symbol. Such is called by Professor Bell the scalar instance 
of the symbolic expression. 

For binary forms, this notation is unsuitable, since single 
suffixes do duty for the actual coefficients. 


7. Polar Forms. 

First we contract the notation ai^i + a 2 a "2 of the binary case 
to ttj,, so that the binary p-ic is denoted by a/ . Thus 

a/ = (uq, Uj, . . . , J ^ 2 )"^ 1 ^ ^22) 

= do ^1 + 7>«i ^2 + • • • + ^2^1 

Then an excellent example of the use of these symbols is in the 
polar process. 

Since behave as ordinary numbers, we have 


a/ = pa/-^ai, 

VXi 


(D884) 


13 



178 GENERAL PROPERTIES OF INVARIANTS [Chap. 


Multiply by respectively; add and then divide by p. Hence 


1 

V 




a 


p-i 

X 




(24) 


This is the first polar of a/ with regard to y. Similarly the second 
polar is 


1 

p(p- 1 ) 



(25) 


and so on. The rth polar is 


*y • 


(26) 


The process ends in j) steps, for then all higher differentiation 
produces zero. 

Exactly the same notation denotes the vth polar of the 

p-ic in n homogeneous variables. 

This may at once be verified. 

Further, we may have mixed polar forms involving anything 
up to p different sets [ir], [y], [z], .... 


EXAMPLES 

1. The quadratic aix^= + 2aiXiX.i 02^2^ 

a^a,/ = -f f 

2. The cubic a*® = (Uo, a^, « 2 » -<^ 2 )® has two intermediate polars 

(Kx^oLy, oLxOLi/f ftud One mixed polar OLxOi,,<Xz. Find their expressions in fAll. 

3. The ternary cubic in canonical form is -f- x.^^ + x^^ + QrnXiX^x.^, 
What are its polars of type oLx^oty, a^a^a^ ? 

Ans. Xi^yi + «2^V2 + ^3^2/3 + 2mx,iX^yi -}~ 27nx^x^y.2 2mx^x,^y^, 

+ ^*^22/22^2 + + w(a:iy .^23 -i- x^y^z^ j- argyiSg + x.^y^z^ 

-r ^3yi^2 4- 3:3^2^!). 

4 . If ^x^ denote {cciX -{■ (X 2 y + ocs)^, in order to symbolize the conic 
ax^ + 2Axy + by^ -j- 2gx + 2fy + c — 0, what does ocxocx' = 0 symbolize? 

Ans. (ax -f Ay -f <7)0:' (Ax 4- by f)y' + gx ^ fy c ^ 0. 

6. Prove symbolically that if the polar of point P for a conic pavsses 
through Qf that of Q passes through P. 

8. Prove symbolically that when the sets [a;], [y], . . . are all the same 
as the original [x], each polar reverts to the original form. 

Ans. QCxCtyOLz . . . = OLxOLxOix • • • —OLyP, 

7. If (aiic 4~ oL^y + as)^ = 0 symbolizes a plane curve of order p in 

Cartesian co-ordinates, and = 0 is defined as its rth polar for 



X.J POLARIZATION. EQUIVALENT SYMBOLS 179 

the point {x', y'), prove that the rth polar is n onr-xr^ a 
only paaaea through (x', y') if (^. „„ the orSfcS^^ “ 

8. Eanivalent Symbols. 

One objection to these Clebsch-Aronhold symbols will nrob 

to the wh.t .. ,, „r “ rLToi 

degwe two, or more, m the coeffleient, of . given ground 
/ expressed symboUcally as a Por bv deL,+- ^ ^ 

set of n symMs a, a . . ' wlvi, ^ ^ T * 

expre,, nn actal eoefflcient to producte p .t . time. SLt 
polyBomml of degree p to the symbol, a i, equivalent to ^ 
form linear m the actual coefficient, of the ground form/ The 
mmpleet way of meeting thi, difficulty i, tot to conaider 
simultaneous sets of coefficients, and functions which are linear 

quS^toi •»» “■“■y 

U= Ug Xj^ -j- 2 aj ail Xj + 02 asg®] 
V^bgxi‘+2bjXiX^ + b^X2^f’ ' ' • 

written symbolmally as a/ and respectively, where 
Px * (Pi^i + ^ 2 ^ 2 ) » and ^2 are symbols referring 
exclusively t-> the coefficients 6j, by, in particular 

^i=bg, ft^2==6i, . . (28) 

Then* manifestly any expre.ssion bilinear in the a’s and 6’s can 
be expressed by a suitable combination of a’s and j8’s. Thus ' 
ttgb^ = or, again, 

«o *2 + Oa&o - ~ (a^)2^ say. (29) 

Conversely {a^f, being quadratic in both uj, and jSj, can 
be expressed unambiguously in terms of the coefficients a„, a, a, 

and bg, 6,, b^. 

But suppose that the given quadratics V, V are identical. 
Then 

^0 ~ ^0 > % ~ » <^ 2 ^^ ^2 f 

while 

^0^2 ~i~ ^2^0 2 ( 3 ^ 6 j — ^ 1 ^)* 

This last expression, say, which is the discriminant of the 



i8o GENERAL PROPERTIES OF INVARIANTS [Chap. 

quadratic J7, appears here as a particular case of a bilinear 
invariant Z)j 2 > where 

Z)i2 = ^0^2 ”1” ^2^0 2a^6j . 

This gives us a clue for the symbolic expression of express 

a^a , — symbolically by using the Aronhold operator 



on Du to render it linear in both sets [a] and [b], after which we 
are at liberty to use symbols a and j8. There are now two a’s and 
two j8’s in every term of the result and the letters a and jS 
are said to be equivalent symbols. 

We therefore write 

V^a/ 

Uq ^ 

' (?! = ttj = P1P2 > 

a^ == = ^ 2 ^ • 

Similarly for the binary p-ic 
/=a/=i5/ 

= {Oq, Oyy a2, * , • y a^^ Xyy 

Any product of degree two in the coefficients, say a,. a.;, is 
symbolized either as 

or as 

which mean exactly the same actual product a^a^. Conversely 
any product of p a’s and p j8’s stands for a unique product of 
two coefficients a^, ay The value of this result will appear in 
the sequel, for at present it seems artificial and useless. 

In general, to express a product 

ao^a/...a^/ 

of degree i in the coefficients of the ground form we introduce 
i equivalent symbols a, y . . . ; or, what is the same thing, 
we render it linear in i different coefficient sets [a], [6], [c] . . . 



X.] 


EQUIVALENT SYMBOLS 


i8i 


by use of Aronhold operators (b ^^- and then sub- 

stitute the symbols a, j8, y . . . as before.^ 

For example, if /= (ao» <^v ^ 2 )^ ^ binary cubic, 

then 


= ttg ^ 2 * 


Conversely, a polynomial of degree three in each set a, j8, y 
. symbolizes a polynomial of degree three in a^, ag, ag. Thus 


EXAMPLES 


1. Prove that, if (a 3) denote (aip 2 “* “ 2 ^ 1 ), the Jacobian of two quad- 
ratics OLx^, is symbolized by (cL^)oix^x» 


2. If 


w 

Cl 


is the determinant whose rows are the coefficients of 


three quadratics symbolized by yx^, prove that this determinant 

is symbolized by — (Py)(Y^)(^P)' 


3. If a and are equivalent symbols, prove that both (aP)axpx and 
(Py) (yot) (aP) vanish identically. 

4 . The symbol {aP)ax?^“' denotes the Jacobian of a binary p-ic, 
ax^^ and a binary q-ic p^v, neglecting a numerical factor pq. 


5. Show that for a cubic {a^, a^, Ug ^ otj, symbolized by a** and 
P.r®, the Hessian is given by (apl^a^px* 


[The Hessian is 
and p for rowg]. 


a^x^-^-a^x^ 02^1 "T ® 3 ^2 


Use symbol a fqr row^, 


6. Show that any determinant of order n w^hose rows (or columns) 
each involve the coefficients a* of a quantic linearly can be symbolized by 
the use of n equivalent symbols, one for each row (or column). 


^ Various extensions of the use of these symbols, sometimes called umbral 
symbols, or umbras, can be made, as, for example, in the next chapter. We 
pass over the question of general formSy whose order p may not be an integer; 
cf. Encykhpadie der Math. If’tsj.,’ Ill, 3, 6 (1922), p. 7: and also that of the 
relation of symbols to a ;^wer series 

a© “h d- ^2^* d" ^^3^* + . . . 

whose coefficients are specific integers or combinations thereof. The reader 
will find a very interesting account of these by K, T. Bell, Algebraic ^rd^mc^tc 
(New York. 1927), 146-159. 



CHAPTER XI 


The First Fundamental Theorem 

1. Symbolic Factors. Inner and Outer Products. 

We are now in a position to consider a theorem of funda- 
mental importance in the invariant theory. It enables us to 
construct, with the help of the symbols, as many invariants 
and CO variants of given ground forms as we like ; and conversely 
the theorem proves that by adhering to a specified mode of 
construction, all invariants and co variants may be found. 

As a preliminary let us consider the Jacobian of three ternary 

quadratic forms U, V, W given by 
♦ ' 

S aij Xi Xj, S bij XiXj, S c ,- j x, Xj 

i,j = 1, 2, 3 , whose symbolic forms are, say, 

„ 2 a 2 ,, 2 

9 Px 9 Vx 9 

where a^= + <^2^2 + which may be regarded as an 

inner product of the set uj, ag, and the set Xg, X3. The 
symbolic form a/ is in fact the square of a symbolic linear form 
or inner product. All such factors a^., y^. are called symbolic 

factors of first type or symbolic inner products. 

Now the Jacobian of C/, 7 , W is 

du dv 

dxi dxi 

dU dV 

3x2 ^^2 

du dv 

0 X3 0 X3 



182 



Chap. XL] INNER AND OUTER PRODUCTS 183 

since this determinant on expansion involves exactlv two of each 
symbol a, jS, y in each term, 

«i Pi Yi 

= 8 P 2 72 (^xPx7x=H<^Py)0'xPxYx, 

®3 Ps Yz 

which is the symbolic form of the Jacobian of three ternary 

quadratics. It involves a determinantal factor (ajSy) which is 
an Older proditd of the symbolic linear sets a, jS, y and sometimes 
is called a bracket factor or factor of the second type, to distinguish 
it from the numerical factor 8 and the symbolic inner product 
or linear factors a^., y^ which are of the first type. 

Factors of these two kinds are characteristic of covariants 
(and invariants) expressed symbolically; indeed the fundamental 
theorem will demonstrate that every rational integral covariant 
of one or more ground forms involving variables . . . , 

can be. expressed symbolically entirely by means of these two 
kinds of symbolic factor. 

Since an invariant contains no x, it is symbolized entirely 
by means of a, y, and according to this theorem it cannot 
involve factors of type So it is composed entirely of the 
second type, the determinantal factor involving n symbols. 

2. Effect of Linear Transformation on the Symbols. 

First we must consider what happens to a general ;>-ic in 
n variables, symbolized by 

(aiX’i -1- agXg + . . . + . . (1) 

when a linear transformation x— ► x' is made. As in §6, p. 177, 
let the general term of this p-ic have coefficient 

( 2 ) 

If the linear transformation is 

+ • • • ^ Ij 2, . . . , n, (3) 

the effect on is 

(«lll + • • • + + • • ’ + + • • • (0 

atxf + ay^xf + ,. . + ac^xf. 


or 



i 84 the first fundamental THEOREM [Chap. 

Hence, as in (1 1), p. 149, the symbols a behave as a set contragredient 
to X, a result of fundamental importance. Also the j>-ic itself 
becomes 

{a^X^ 4* • • • + ^ • • • (5) 

say, where the general term of the f-ic in . . . , x,/ has co- 
efficient 

a\jj , . . . = a/a/%' . . . = . . . , . . (6) 

I', *»]', r • • • denoting the ith, jth, A,th ... of the set 5 . . . . 

For example, the cubic a/ becomes and the coefficient 
of x-^^x^ is a'ii 2 =~ a^'a^'ag' = 

It will be seen that the new coefficients are obtained from the 
old by exhaustively polarizing the y-ic a/ with regard to 
rj, ^ in all possible ways, in agreement with the previous 
result for binary forms. Similarly for other symbols 6, c, . . . , 
belonging to general forms 6/, c/‘, .... 

Next, the effect of the transformation on a symbolic deter- 
minant {ab . . . m) is to give 

a^ a^i ... a^; 

{a’b'...m')= ... =. {$r, . . .w) (ab . . .tn), (7) 

niyj ... Mcj 

another result of fundamental importance. Further, the effect 
on minors of this symbolic determinant is given by the theorem 
of corresponding matrices. Thus 

a^ = a^, Og = a?7, . . . , a,j — a^, 

(a'b%=(ab\ev') 

(a'b'c%j,= (abc\$WaScc. .... (8) 

3. Converse Theorem. 

This at once gives us the converse of the First Fundamental 
Theorem in the following form: 

Every symbolic 'product whose factors are solely of the two types 
a, or (ab . . . m) satisfies the invariant condition. 

For let P = (abc ,..) (def. 



XL] SYMBOLICAL EXPRESSION OF INVARIANTS 185 

be such a product, consisting of w bracket factors and td 
x-factors, involving symbols a, 6, . , . , A, . . . of one or more 
given ground forms (1). 

Let P' = (a'6'0' . . . ) (d'e'f • • • be the corre- 

sponding function of the accented symbols and variables. 

Since == g^. and (a'6'c' ...)== (^rj ^ . . . ) {abc . . . ), &c., we 
have * 

. . .a>r{abc...){def. . . ,.g,K • • • | 

which proves the theorem. 

As an example, consider the ternary symbolic product 
P — {abc) (abd) c^d ^ , 

so that 

P' (a'6'c') {a'b'd') c\. dV ( 1 ^ If W {aU) 

Let this be expanded in full, as a polynomial in all the arguments 
. . . , dg', X2, iCs. On the left, a typical 

term is 

a\^Vic\d\c\d\ x\\ 

which is of degree two in each set a', 6', c', d', and x\ Since 
each symbolic factor of either kind {a'b'c') or is linear and 
homogeneous in its sets a\ 6', c', x\ it is at once apparent that 
the dimensions of the typical term are also given merely by 
counting how many of each symbol a' or variable x' occur in 
the unexpanded product P' : and this will be true in general. 

Thus P is a quadratic in x because it has two factors c^., d^., 
and further, if P is not a mere symbolic covariant but an actual 
covariant, we infer that the symbols a, 6, c, d refer to quadratic 
ground forms, because there are two of each symbol. 

Hence P is a covariant of weight two, the index of 
and of degree four, in one quadratic, when 

af=^bf==Cjf=dJ^ 

are equivalent symbolic forms : or again is of partial degrees 
(L 3 ) in two quadratics a/ and b/ ™ c^^ = df where symbols 
by c, d but not a refer to the second quadratic: or again is of 
partial degrees (2, 1, 1) in three quadratics: and so on. 



i86 THE FIRST FUNDAMENTAL THEOREM [Chap 
4. The Valency Condition pq — nw -f vj iot Single Ground Form. 

If we continue to illustrate with the ternary case of the 
p-ic 

f=a/=b/=c/=... 


where a, b, c are equivalent symbols, we can see at once that 
a product such as 

P = {abc) (bed) (eda) {dab ) . . . 


involving w bracket factors but no a?-factors contains 3w symbols 
in all, each distinct symbol, such as a, occurring p times. 
Hence for a ternary invariant involving q different symbols of 
a p-ic, the relation 

pq=3w 


is essential. In general the condition 

j)q^ nw 

connects the degree q and the weight w of any invariant of a 
p-ic in n variables. For the subsequent proof of the fundamental 
theorem will show that any invariant can be expressed as a sum 
of symbolic products such as P. 

Again, a more general product, say 

Q (abc) (bed) (eda) b^ d^ 

involving td cr-factors and w bracket factors, is a covariant of 
an n-ary p-ic if 

pq~nw+ w, 

as is seen by counting all the symbols a, b, c .. . which occur. 
Here tu = 7, w 3, n == 3, p == 4, y = 4. 

It is desirable to give a name to such relations between 
positive integers indicating degree, weight, or order of invariants 
and the like. Let them be called valency conditions. 


EXAMPLES 

1. A binary form of even order has no covariant of odd order. 

2. A binary quartic has at least two invariants, (a^)*, Oy)* (Y^)^ («?)*• 
Express these non -symbolically when 

(®0> ®2» ^4 $ ^ 2 )^ ~ ^ Y*^» 



XL] THE VALENCY CONDITION 187 

8. A binary n4c has an invariant of degree two only if n is even. 
For let (a^, J Xi, jCg)" ~ == Then 

(aP)» = Unba ^ miibu^i + ^2) — . . . + ( — V^an^o 

== 0 if w is odd. 

5. First Fundamental Theorem for a System of Linear Forms. 

Let us first prove the theorem for linear forms in n homo- 
geneous variables. Starting with a set of variables x^, ^ x,^ 

and any number q(^n) of linear forms, not necessarily all 
distinct, 

+ a 2 X 2 + . . . + a,,x,^ 

B = 6^ = -j- 62 ^2 “t~ • • • ^ 

K “i” ^2^2 H"* • • • H~ 

we enunciate the theorem as follows. 


Fundamental Theorem for Linear Forms. 

Every rational integral projective invariant of linear forms 
a^, bx, c^, . . . , is expressible as an aggregate of terms consisting 
entirely of bracket factors of the type (ab . . . m) together with 
numerical coefficients. 

This type of factor (aft . . . m) is indeed an w-rowed deter- 
minant of the coefficient matrix 


Proof . — 


■«1 

a-i . 

• ««' 


h • 

. K ' 

.h 

* < 



( 10 ) 


Let the result of a linear transformation x-^ x' on the 
variables change the linear forms A to a'^», B to ft'^.^, &c. Then 
the new coefficient matrix is 



®2 * * • 



av 


h' •• 

. k,:. 

_k^ 

kv 


where 




. ( 11 ) 



i88 THE FIRST FUNDAMENTAL THEOREM [Chap. 


and, as before, the coefficient matrix of the linear transform- 
ation is 




^1 Vi 


L 'On -J 


. ( 12 ) 


Now if 1 (a, b, , Jc) is an invariant rational and integral in 
the nq coefficients of these forms, then by hypothesis 

I{a\b' k') = \M\«’Ha,b,...,k). . (13) 


This is an identity in the w* elements $y, ... , a)„, so that it still 

g 

remains an identity after differentiation by ^ ^ or more generally 


by the Cayley operator (§2, p. 113) 





8^2 9^1 


3 3 3 

dcji 3ct>2 3a), j 


(14) 


Kegarded as a function of , a),^ the right member of (13) 

is a polynomial, homogeneous and of degree w in the set 

as well as in each of the other sets or ^ . or o). 

For such sets only enter by way of the determinant | M | which 
is linear in each set. To balance this the left member of (13) 
must also be homogeneous and of degree w in each set. But 
since I {a', 6', . . . , k') is explicitly a polynomial in a/, . . . , fc,/, 
which are the same as aj, . . . , k<a, it follows that every term on 
the left of (13) contains exactly w factors aj, 6^ . . . involving 
w factors an, bn , involving t], and so on. 

We now operate on both sides of (13) with Q, Since ( (33), 

p. 122), 

Qa^bnC ^- . . . muTt , . . = S {abc . . . m) . . . , (16) 

whdre in S the letters a, b, . , , , m, r, , , . are suitably permuted, 
the precise way being immaterial, we obtain for every term on 
the left an aggregate of terms each containing a bracket factor 
like (abc . . . m) but one ^ fewer, one rj fewer, &c. On the right 
we obtain (§3\p. 114) 

w{w + \1 ) . . . (m) -f n — 1) I M I '"“"7 (a, b, . . . , k). (10) 



XI.] 


PROOF FOR LINEAR FORMS 


189 


Repeating this process until Q operates w times on the original 
identity, we exhaust all of 7 ), . . . , co on either side, gaining 
at every stage one new bracket factor in each term on the left, 
and thereby accounting for all the letters a, 6, c, . . . on Ihe left. 
Thus 

SA(a6c. . . m) (rs = fjil{a, b,c, .), . (17) 

where A, are numerical constants, and ja 4= 0 (Ex. 6, p. 115). 
Dividing by we finally express I (a, h, c, , . .) in the desired 
form. 

Corollary I. — The number q of linear forins, includiuff 
repetitions y in an invariant is a multiple of n. In fact q — nw. 

Corollary II. — The simplest invariant of linear forms 
a^, bx ... is (ab . . . m) involving n different forms) it is the 
determinant of the coefficient nmtrix of n such forms. 

For it is the simplest expression of the requisite type, and 
it would vanish if two of the sets a, 6 in the determinant were 
identical. 

Corollary III. — No invariant exists of less than n linear 
forms. 

Corollary IV. — Each term in the summation on the left of 
(17) is an invariant. This follows from §3, p. 184. 

The above proof contains the leading idea required in the 
general proof for invariants of any ground forms. It has the 
great merit of carrying with it the actual method for throwing 
a given invariant into this convenient symbolic form. And 
although the process would be tedious in complicated cases, the 
reader will find it very instructive to follow out the steps in 
detail for a few simple cases, in order to grasp the several prin- 
ciples of the proof. 

6. Invariants 0 ! One or More General Ground Forms. 

Next we prove the Fundamental Theorem for the case of 
invariants of forms of higher order than the linear. But for the 
initial step, the proof is exactly the same. Thus let there now 



190 THE FIRST FUNDAMENTAL THEOREM [Chap- 

be q given ground forms of orders r, s, . . . , t respectively, 
written symbolically as 

A = dj == (ctj -f- ^2 ^2 • • • "i“ ^ 

B = 6/ = {h^x^ + ^2^2 + • • • + K^nY 

K=^kJ — (Jc^x^ + + . . . + k,,x,y 

where as before these q forms need not necessarily all be distinct. 
We utilize, in all, qn different symbols a^, ag, . . . , a product 
of T fit’s gives an actual coeflBcient of the first ground form -4; 
a product of s 6’s, one belonging to the next form B; and so on. 

In this way we express the first ground form as a perfect rth 
power of a symbolic linear form; and similarly for the rest. 
Then if the linear transformation be made as before, 
+ • • • + becomes + ^ 7)^2 + Accordingly 

the form A of order r becomes 

{atXy' + arfX^ +...)'*= (W + (^2 ^2 + ...y ^ fit'/, 

where a' is the corresponding symbol associated with the new 
variable x\ Similarly for the symbols b, . . . , k. 

Now if the invariant m) is a polynomial in the 

actual coefficients of forms 4, 5, . . . , iiC, it is also a polynomial 
in the qn symbols a ^, . . . , when actual coefficients are resolved 
into symbols as in §5 (17). It can then be treated exactly as in 
the previous case; and the result of the operation Q'*’ upon the 
identity 

expresses the invariant entirely in terms of the bracket factors 
(a6 . . . m). 

Since this bracket factor on expansion is linear in the set 
ttj, . . . , a,j, and therefore vanishes if a appears twice in the 
same factor, we infer that exactly r such factors each contain 
a symbol a; s factors contain k; and so on. 

Further there are exactly w such factors, one for each step 
in the operation 15^" (cf. Ex. 1, 4, p. 123). Hence the total 
number of symbols a, k in one such product of factors is 
nw, so that the following valency relation must hold: 

r 4’ 5 + ^ — 'nw. 




XL] 


PROOF FOR HIGHER ORDERS 


191 


Corollary I. — The degree q and the weight w of an invariant 
of a single ground form of order p in n variables satisfy the 
condition pq = nw. 

For in this case r = s ~ t = . . i ~ p, and the q forms 
A, By ... are all the same. Symbolically we denote such a form 
/ 

/= a/ =6 / A:/. 

7. Examples of Invariants. Interchange 0 ! Equivalent Symbols. 

Various examples have already been given of the symbolic 
form of invariants. We must now consider more particularly a 
few cases where a single ground form is involved and q equivalent 
symbols are needed for expressing an invariant of degree q in its 
coefficients. 

Let the ground form be 

/-a/= 6 / = c/=,&c. 

Example 1 . — 

If / is a binary quadratic, 

/= a/ = + 20120:1 *2 + 0220:2®, 

then its discriminant is D — aiia22 ~ ^12 > which is symbolized 
as a^^b^^ — a^a^bib^y or equally well as b^a^ — hib^a^a^, which 
is formed from the first expression by interchanging the roles of 
a and b throughout. This process is called the interchange of 
equivalent symbols, and for two symbols introduces two alter- 
native forms of the expression, for three symbols, 3! alternatives, 
and for n symbols, n\ alternatives, when interchanges are made 
in all possible ways. Taking the first symbolic form, we have 

J) = ^2(^1 ^2 ^2^1) a-jb^lph^t 

Interchanging equivalent symbols, we also have 

D—bia^iba), 

But (6a) = — (a6). Hence by addition 

2D = aib2(ab) — bia^iab) = {ab)^. 



192 THE FIRST FUNDAMENTAL THEOREM [Chap. 

This little device which is of great use in throwing invariants 
into convenient form deserves close study. Let us now adopt 
a previous notation, in order to abbreviate such a process. So 
in this example we write 

B = a^h^iah), 2D == aj)^{ah) = {ab) {ah) = {a6)2. 
Example 2. — 

The discriminant of a ternary quadratic f= a/ = 'Si a^jX^x^ 
is an invariant of degree three in the coefficients 

Uii ai2 %3 

2)— I aij\ ^21 ^22 ®23 > ^Ji)* 

^31 ^32 ^33 

Here a^j = a^aj == hj)j = and three equivalent symbols are 
needed to represent the expansion of the three-rowed deter- 
minant D. Since D is linear in its column elements we may 
write (Ex. 6, p.' 181) 



< 

^1^2 ^ 1^3 

«1 

h 

Cl 

D = 


b^ C2C3 == 

^2 

h 

C2 



6362 

^3 

^3 

C3 


after extracting common factors from columns. Accordingly 
B— a^b^c^{abc). 

Now we interchange the equivalent symbols a, 6, c in all six possible 
ways, and add the results. Since {ahc) = — {acb) ~ (6ca), &c.,^ 
this gives 

62) = (0162^3 — + «2^3^1 ~ «2^1^3 + «3^1^2 — %^2^) 

== a^h^c^ (abc) = {abc)^. 

So D=-l{abc)^; 

and by the fundamental theorem this is an invariant. 

In general the discriminant of a quadratic form in n variables 
is 

2) = 1 I = \ {abc . . . m)^. 
ni 



XL] INTERCHANGE OF EQUIVALENT SYMBOLS 193 
Example 3. — 

The ternary cubic / — a/ = S aij^XiZjX^ has two invariants 
which can be symbolized by 

S = (abc) {abd) (acd) (bed) 

T ™ (abc) (abd) (ace) (bef) (def)^. 

Here a, b, c, d, e,f are equivalent symbols whose actual coefficient 
sets are all equal: 

==6(6^6^=... i, j, * = 1, 2, 3. 

Merely counting the number of a’s, &c., verifies the invariant 
property of S and T; counting bracket factors gives the weight 
Wy and the number of different letters a, 6, &c., gives the degree q. 
Thus 

S:n =3, ^ = 3 , 9 = 4 , = 4 , 

p=3, q—Q, w=6y 

which naturally satisfy the valency condition pq = niv. 

Symbolic invariants may vanish identically: thus the simpler 
looking invariant (abc)^ is zero. For 

(abc)^ = (bac)^ 

on interchanging equivalent symbols. But 

{bacf ” [— (abc)Y = — (abc)^; hence (abc)^ + (abc)^ = 0. 

Example 4 , — 

(abc . . . m)*' is the invariant of lowest weight and degree for 
the general form of order p, but it vanishes identically if p is 
odd. 

8.1 Double Convolution of Symbols referring to a Quadric Form. 

The following theorem follows up the process used in the 
above Example 2. It gives in general what was originally dis- 
covered by Gordan in his successful researches upon quadratics 
in ternary and quaternary forms. The technique has already 
been explained in § 9 , p. 44 . 

1 This section 8 may be omitted on a first reading. 


(D 884) 


14 



194 the first fundamental THEOREM [Chap. 

Theorem. — A symbolic p^oduct P of bracket factors^ wherein h 
equivalent s7j7nbols of a quadric are explicitly convolved once, is 
expressible as a sum of terms in which these syimbols are explicitly 
convolved twice. 

Proof . — 

For clearness we first consider a simple example, n 5, 
h — 3. Let 

P — (ahc rs) (ab M) {cN)Q, 

where a, b, c are equivalent symbols, convolved once. Since 
they belong to a quadric, they have duplicates, which occur 
elsewhere in factors of P, either convolved in one factor or not. 
In this example they occur in two factors, and the remaining 
symbols r, s, M, N, Q represent suitable arbitrary elements, 
involving neither a, nor b, nor c. This last condition is essential 
to the success of the proof. 

Interchanging a, 6, c in all 3! ways we have 3! alternative 
expressions for P. AVe add them together and obtain 

3! P (abc rs) (abM) (cN)Q-\- {bac rs) {baM) (cN)Q-{- 

But the first factor of each term is of type ± (abcrs). Hence 

3! P= (abcrs) [(abM) (cN)Q- (baM) (cN)Q+ 

■=- 2(abcrs) {(abM) (cN)Q+(bcM) (aN)Q+(caM) (bN)Q} 
= 2(abc rs) (abM.) (cN)Q 
— 2(abcrs) (ahem' m"') (mN)Q, 

where M mm'm'\ by a fundamental identity I, §9, p. 45. 
This proves the theorem for P. Here every step has been given 
because each is typical of the general case. Thus we take 

P=(A,BjK) (A,M) (BjN) Q, 

where Ai denotes i symbols a,, . . . , and b^, . . . , bj, 

all i + j of which are equivalent and refer to one quadric. Then 
by all possible interchanges 

{i + j)\ P= (AiBjK) (S ± {AiM) {BjN)Q} 

= i\ j\ {A,BjK) (i^) {BjN)Q. 



XL] 


REDUCTION OF QUADRATIC FORMS 


19s 


If now the n — i columns M are written we have 

by a fundamental identity 

{i + j)\ P=i\j\ 

with AiBj convolved twice. Similarly if the first factor of P has 
AiBiCfi ... all referring to the one quadric, we obtain by the 
same process 

(i + j + k+...)\ P^i\j\ 

This proves the theorem. 

Corollary. — The quadric imxy be replaced by a form of order 
2p in which P contains equivalent symbols convolved 2p — 1 times. 
The process produces a further convolution. 

Again, the new convolution may be assembled in any assigned 
bracket factor g except the first. The process deranges the 
other original symbols of the factor g, but leaves them 
implicitly convolved. 

For instance, in the first example 

g = (obM) = {abmm'nif^), 

and after convolving abc in g, the symbols ilf, expressed in the 
currency of a as m, m\ m", are implicitly convolved in a series 

of three terms mm\ m". But the factor g selected might equally 
well have been (cN) or even a factor of Q, since the fundamental 
identities apply in each case. 

9. Solution of Symbolic Linear Equations. 

Equations frequently arise which are linear in one set x of 
variables. Even when expressed symbolically they can be solved 
by the ordinary methods. Thus for a quaternary case consider 

a^ay=0, bj),= 0, c*Cj = 0, . . (19) 

which arise from three quadrics polarized with regard to y, z, t 
respectively (§7, p. 177). 

Writing each in full as far as x is concerned we have 
~ ay{a^Xi + a 2®2 + Then after 



196 FIRST FUNDAMENTAL THEOREM [Chap. XL 

multiplying the three by 6^0^(60)23, respect- 

ively and adding we obtain 

(iyb,Ct{{abc\2z^^^+(^<^)A2z^^)=^ • (^ 0 ) 

Solving this and similar results we find 

: X2 : ajg : 6^0^(060)234 : a^6^c^ (060)3^4 : 

: o^6^c^(o6c)i24 : • • (21) 

Now if we introduce arbitrary values u^, W2> ^3» 
that Oa; = 2 u^Xi — 0, then 

{ahcu)ayh2Ct == 0 ( 22 ) 

It will be seen that the above equations can be solved exactly 
as linear equations = 0, 6^. = 0, Oj. = 0, provided that we 
maintain, throughout, the original common factors o^, 6^, which 
give actuality to the equations. 

Further, if o, 6, c are equivalent symbols we can follow the 
methods of p. 192. Thus (21) and (22) become 

: x ^ : : x,^ ~ (060 | yzt) {(ibc)2^ : &c. . (23) 

(a6ct^)o^6^c^ = |'(a6co) (o6c*| 2/2^). . . (24) 


and 



CHAPTER XII 


Multilinear Forms 

1. Multilinear Forms. 

We now set about proving the Fundamental Theorem in a 
more general form, so as to include covariants (§12, p. 144) as 
well as invariants within its scope. To do this we must examine 
more in detail other possible types of ground form than that 
which depends only on one set of variables x^, Xg, . . . , x,i. One 
remarkable feature of the symbolic method is that the more 
general cases are as easy to handle as the special cases, as we 
saw in considering the invariant, multilinear in q ground forms 
of order p, which is symbolically of the same structure as an 
invariant of degree q in one ground form. 

Let us consider the ground form a/, of order p in n homogeneous 
variables x, from this point of view. We take 

/=r=a/=S%;;^...XiX;X;^..., ... (1) 

where each of i, j, ^ . . . has values 1, 2, . . . , w. If we polarize 
this with regard to p — 1 sets z, . . . in succession we render 
it multilinear in p sets x, y, z , , , . For brevity of statement 
we consider the ternary cubic (n~ p= 3). Thus 

a/ == (a^x^ + + ct^x^f = S a,,-,, x, Xj x^, (2) 

where a^ajaj.^ = ajik= See. . (3) 

Operating with 

followodby + (*3i)’ 

where sets y and z are independent of x, we obtain 

* 197 



MULTILINEAR FORMS 


[Chap. 


198 

Such is called a trilinear ternary form: it is multilinear in 
three sets of variables. A simpler example of multilinearity is 
the bilinear form in two sets x, y, say 

arising as first polar of the quadratic 

Owing to the symmetrical suffix relations (3), such multilinear 
forms are not the most general; howbeit they serve as an intro- 
duction to the general form. 

Definition of General Multilinear Form in Sets of n Variables. 

— The form S ajj^ . Xiyj Z ,^ is the general multilinear form in 
p sets of n variables, if every derangement of the suffixes ijk . . . of 
the typical coefficient alters its value. 

For instance, the general ternary bilinear form is 

[ «ii Vi + %2 ^1 2/2 + «i3 ^1 Vz 

s "I 4* ^21^22/1 ‘h ^22^2^2 4* ^23^22/3 

I 4- %2^32/2 "4 ^33®32/3> 

where Such a form cannot be derived by polarizing 

a quadratic S aij XiXj, since the coefficients of x^y^ and x^yi differ. 

2. Symbolic Representation of Multilinear Forms. 

* Following the lines suggested in the case of the binary form 
we readily find a suitable vsymbol for the general form in p sets 
x,y,z 

Operating on / with . _ ^ we obtain the single coefficient 

dXidyjdz^.,. 

. . Then if we write 

dH 

(M’. ‘ 2 . 3 »), 

as in (17), p. 173, the p symbols a,-, bp c^. . . . commute with one 
another, because of the fundamental law of partial differentiation. 
Also they have an actual significance if they occur in a product 
involving exactly p of them — one a, one b, and so on. 

Hence 

• • • = =■■ &c. ; 



XII.] .SYMBOLS FOR MULTILINEAR FORMS 


199 


so that 

/= . . . a;,%z^ . . . 

= (oiOJi + . . . + a„x„) (b^yi + . . . b,,y„) X . . . 

== Cl^byCg . • . , 

which symbolizes the general multilinear form. It involves p 
sets of variables (np variables in all) and ^ sets of symbols a,b,c,. , 
(np symbols in all). In particular if the form is symmetrical in 
two sets of variables x and y, we may take the corresponding 
symbols as equal, now writing 

^i}k . . . • • • i f • • • > 

for in this case there is no reason to distinguish between 
and , that is to say, betvreen Uibj and 

By making some of the sets x, y, z equal we include as a special 
case of the multilinear form the form of higher order in fewer 
sets of n variables. For instance 

are respectively (2, 2), (1, 1, 1, 1), (1, 1, 1, 1) forms. The (2, 2) 
form is a special case of the next form when x — y, z = t. This 
in turn is a special case of the third form, for it is symmetrical 
in X and y and in z and t. The last is the general quadrilinear 
form. 

3. Classification of Multilinear Forms. 

From the point of view of the invariant theory, the natural 
way to analyse these forms is according to the behaviour of the 
sets of variables x, y, . . . . To avoid constant repetition let us 
understand by the simple phrase: “the variables x, y, , . , 
the sets of variables (x^, x^, • • • , (y^, • • • > Then 

supposing all variables to undergo linear transformations, say 
X— > x', y — > , everything turns on whether these trans- 

formations are independent or not. For our present purpose 
we adopt the following classification in ascending order of 
complexity. 

L Variables all cogredient. 

II. Variables cogredient and contragredient. 

III. Variables independent. 



200 


MULTILINEAR FORMS 


, [Chap. 


If a?, y, 2 . . . are cogredient and u^v^w . . . are all coiitragredient 
to x, y, 2 . . . , then I is really a special case of II, when one 
type of variable, say y, ty, is entirely absent. The more general 
case III is at present dismissed, so that our chief concern is 
with II. 

Remembering that ir is a column matrix, we write in matrix 
notation { (15), p. 149) 

X = Mx\ y ™ My\ z " Mz\ &c., 
for the cogredient transformations of coefficient matrix M, and 
w = ly-M'-'ty', &c., 

for the induced transformations of the contragredient sets 
u^v,w 


4. Cogredient and Contragredient Symbols. 

The following convention will now prove to be useful when 
we are concerned with multiple forms involving several sets of 
variables x, j/, . . ., u, v, .... MVe use the italics a, 6, c as symbols 
associated with x, y, z, and Greek letters a, P, y as symbols associated 
with the contragredient variables u, v, tv. Further, we write the 
symbolic inner product involving u and a as 

ai^l -1- ttgWg ™ ^a* 


Thus the general multilinear form in these variables is sym- . 
bolized by . , 


Upper and Lower Indices. 

Non-symbolically the general coefficient of / involving p sets 
of variables x, y, z . . . has already been written as a^i/^ with 
p lower suffixes. But it has been found convenient to adopt 
upper indices, as in the theory of determinants, when the contra- 
gredient variables are involved, and to write 

^ijk. . . 

for the typical coefficient of the multilinear form, involving p 
lower and q upper suffixes, answering to p sets of variables 
x,y,z... and q sets oiu,v,w 

Definition of Tensor of Rank r . — The set of coefficients 
P lower and q upper suffixes, each taking all values 



XIL] 


CONTRAGREDIENT SYMBOLS 


201 


1, 2, . . . , n IS cxilled a tensor of orders (p, q). It is sometimes 
called a tensor of rank r{~ p-\~ q). 

This use of the word rank must not be confused with its use 
in the theory of matrices. It has crept in through the work of 
physicists in the theory of relativity. For greater clearness let 
us call such a set a tensor of orders (p, q). 

Examples , — 

4* = ax is the ternary linear form. It has 

orders (1, 0). 

u^oL^ + Sw/ai— Ua is also a ternary linear form, but 

of orders (0, 1). 

L a{ xi Uj == Ux Ua is a bilinear form with contragredient variables x, u, 

'EaijXiyj = axbff is a bilinear form with cogredient variables, and so 
also is ila*JuiVj = UaV^. 

'Zaif^XiXjUkUi ax^Uf? is a (2, 2) form quadratic in two sets of 
contragredient variables x and u, 

5. Equivalent Symbols. 

x\ny homogeneous polynomial in the coefficients of such forms 
can be expressed symbolically by introducing equivalent symbols, 
as at p. 179. So a second degree polynomial in the coefficients 
of the (1,1) form would require two sets of symbols, say 

a/ = aiaj==:a/aj\ 

so that, for example, 

a^^ a^ — a-^oL'^a^ ct^ — a^^ (Zj^ 

The general form , .w involving p sets 

and q sets n, y . . . has p symbol sets a, b, k and q symbol 
sets a, jS, . . . , K. Equivalent symbols a', 6', . . . , A', a', . . . , /c' 
whenever used are such that the substitution 

a' 6' ... k' a' p' ... k' 

a b k a j3 ... k 

leaves the actual form unchanged. 

6. Effect of Linear Transformation on the Symbols. 

We can now prove the following useful theorem. 

Under lineur transfortnation, M, the symbols a,'b, c . . . 

associated with variables x are cogredient with u, and symbols a, j8, 
y . . . are cogredient with x. 



202 


MULTILINEAR FORMS 


[Chap. 


For let X — > x\ u -> u' denote the linear transformations. 
Then the general form in variables x, y, . . . , w, ... retains 
its same orders when expressed in terms of x\ y\ , , . ^ , 

If we use accents for symbols after transformation, we have in 
the case of the (1, 1) form 

-- S a/ X,. uj -= S «'/ x/ uj -= . (4) 

This is directly and explicitly secured by taking 

^ ic' ~ a' ~ '^a (*^) 


in all cases, and defining the new symbols a', a by these relations. 
But they are the characteristic conditions of contragredience. 
Hence the transformation a — > a' is contragredient to x — > x', 
and a — > a' to But u is contragredient to x. So a 

is cogredient with u, and a with x. Similarly for the general 
form. 

In greater detail, let 

^'2 = + 12^2 + ^ 2 ^ 3 ' h • • • ( 6 ) 

^ 3 — ^3^1 ~l~ ^3^2 ~f” ^3^3 J 


where for shortness ternary variables are considered. Then the 
contragredient transformation for u' u is (p. 148, (8) ) 


^ ^ 2^2 ^ 3 % — 

^2 = Vl '^1 + V 2'^2 + '^ 3^3 ^ S 
^^3^ “ + ^2^2 “(“ ^3% ~ 


( 7 ) 


Consequently, for the symbols a, which are cogredient with u, 

(Zl = Clc , , Gq — ttj'- J ... ( 8 ) 

while the solution of (6) gives 

„ ^ 3 ' _ _ ^ _ /Q\ 

M (^T/x) ^ 

so that the symbols a, cogredient with x, satisfy analogous 
equations 

a/ _ a/ _ ag' 1 



XII.] PROOF OF FUNDAMENTAL THEOREM 


203 


Equations ( 8 ) hold for every symbol a,b,c , , , , and ( 10 ) for every 
a, j3, y . . . . They also typify the general case when n columns 
^,7], ,oj occur in ( 6 ) and w-rowed determinants in the denomi- 
nators of (10). We can now prove the Fundamental Theorem. 

7. Fundamental Theorem for the General Multilinear Form. 

Every rational integral invariant of nmifyil inear ground forms 
whose symbols are a, b, c . . . , a, j9, y . . . is expressible as 
symbolic polynornials, the factors of tvhose terms are of three types 

Oa, (abc...m), (aj 8 y.../i), 

together with numerical coefficients. 

The proof follows the lines of previous simpler cases, but 
requires two more preliminary lemmas w^hich for clearness will 
be explained for the ternary case. 

In the preceding formulae (7), ( 8 ), ( 10 ), the coefficients 
7], 5 appear along with symbols a, /3, y . . . . Let us now 
consider a^ all to be of the same type as aa, and (aj3|), 

(a^7j), (arjl), &c., all to be of the same type as (a^y). So we 
provisionally use inclusive symbols, 0, (f), if which may take the 
values 

0, (f), 7 ;, a, ^ . . . . 

d d d 

Lemma I. — The effect of the Cayley operator D = - > 

^^3 

upon a product of factors of types a^, (abc), which includes 

among its symbols one f, one tj, and one is another product of 
the same tijpes, but excluding tj, and 

Such a product is typified by the following cases: 

(i) Q=(^7j0N, 

(ii) Q^(he)m)N, 

(iii) 

(iv) Q^m){-ne',f>')ae"r)N, 

(V) (?= 

(vi) Q=^{$d<l>)ar,bi;N, 

(vii) Q = a^bjiC^N. 



204 


MULTILINEAR FORMS 


[Chap, 


Here N denotes factors not containing rj, and variations 
depending merely on derangement of the order ^,17, ^ are excluded. 

Now apply the six-termed determinantal permutation r), t, to 
each of these cases. Then by the fundamental identity II, 
p. 48 (cf. §8, p. 41), the result in each of the cases (i) to (vi) 
is a derangement of the symbols 17, 1^,6 and possibly yielding 
(^77^) as factor. For example, in (vi) it gives 

{adb^~-a^be)N, 

while in (vii) the result is (^77^) (a6c)iV. Hence in all cases the 
process leaves the types unaffected and produces a factor 
But by §5, p. 119 this operation is equivalent to 

Hence dividing throughout by (^77^) the lemma is proved. 


Lemma II . — The effect of the operator 12 on such a product 
involving several sytnbols 77, ^ is a sum of such products of the 
same type but involving one | fewer ^ one t] fewer, and one ^ fewer, 

0 0 0 

Since the operator 12 is linear in , , . it operates on 

^ ^ b 2 ^ b 3 

a function involving m factors, each linear in precisely as the 
ordinary rule for differentiating a product of m functions of one 
variable (cf. (33), p. 122) 


{XY) = ^ Y + X 
dx dx 


dY 

dx 


In fact it gives m terms, in each of which only one factor involving 
I undergoes operation. Likewise for 77 and Thus each such term 
behaves as in Lemma I, so that the second lemma is proved. 

Proof of Fundamental Theorem , — 

Let 1(A) denote a polynomial invariant in several sets of 
coefficients, typified by ' . Then if accents denote the cor- 

responding new coefficients after transformation we have by 
hypothesis 



205 


XIL] PROOF OF FUNDAMENTAL THEOREM 


Although two contragredient transformations a; x', w — u' 
are now involved, they lead to one type of determinant \ M\; 
and the argument already used in §2, p. 169, shows that 
^(^1, . . . , a)„) can only be an integral power of \ M\. This 
power may be zero or even negative. 

For instance, the ternary bilinear form a^. has an absolute 
invariant aa, since 


= Sa/a/- by (8) and (10), 


after using the fundamental identity. Here the index of 
in . . . ,coJ is zero. 

We therefore take in the ternary case I{A')=^ (i7)Q^^'I(A) 
where w is an integer, positive, zero, or negative. Let this be 
thrown into symbolic form. 


iP'i f ••• 9 9 ••• 9 


6,,..., a,. ..,...). 


Then by (8) and (10) the left-hand polynomial can be written 


I (a^, a,, af, 6f, . 


{6?a) \ 

(ho’ (ho’ (ivO”"'^ 


which is a polynomial in its arguments Of, a,,, , (avO, ■ • • 

together with a common denominator, say (^rj^)P, since the original 
function I (A') is homogeneous originally in each set of coefficients 
and therefore finally in each set of symbols a which it requires. 
Multiplying through by we have, as our identity, 

/(fflf, a^, a ,b^,. .. , (a-qC), ($o.C), (Ojo), ...) 


The degree in ^3 is ^ + p throughout; while $ only enters 

the left member by way of factors of types 

h^9 . , (iaQ, i^rja) 

Since at least one such factor arises if merely a single actual 
coefficient A occurs in 1(A), it follows that w-j- p must be a 
positive integer. 



2o6 multilinear FORMS [Chap. 


We now operate w + p times in succession with the Cayley 


operator 


Q = 


d d d 


and obtain, by Lemma II, an aggregate of the desired type on the 
left, with a non-zero numerical multiple of I (A) on the right. ' 
For the process removes one rj, and ^ at each stage, thereby 
freeing the left side entirely of because it does so on the 

right, at the same time preserving the desired types of factors 
on the left. Since exactly the same methods hold for the 
general as for this ternary case, this proves the theorem. 


property 


8. Covariants, Contravariants, and Mixed Concomitants. 

Historically the functions which satisfy the typical invariant 
= . . . ( 11 ) 
have been classified in the following manner: 

(i) Invariants, 

(ii) Covariants, 

(iii) Contravariants, 

[ (iv) Mixed concomitants. 


Concomitants 


Functions (i) involve coefficients of ground forms only: (ii) 
involve variables x^, . . . , x,^ besides; (iii) involve the contra- 
gredient variables u^, . . . , instead of x; (iv) involve any 
variables that may exist. Thus if 

= /2==V. 


are three ternary quadratics, we have as instances of these four 
types of concomitant 

(i) (a6c)2 

(ii) (a6c)a,6j,Cj 


(iii) (ahu) (bcu) (cau) 

(iv) {abu)aj)^ 

(iva) Mj, 


( 12 ) 


For by the general fundamental theorem, concomitants are com- 
posed of symbolic factors a<i, (ohc), (a)3y), while by (5), p. 202, u 



207 


XII.] CLASSIFICATION OF CONCOMITANTS 

behaves like symbols a, 6, c and x like a, j8,y. Hence (abu), 
(abc) are possible factors of a concomitant. Also exactly two 
a’s, two 6’s, two c’s go to form expressions (12), so that they are 
actual concomitants of the quadratics. 

Here (ii) is the cubic covariant of three ternary quadratics 
which we have already seen to be their Jacobian (§1, p. 182): 
(iii) is a cubic contravariant, because on expansion it is a 
cubic in (iv) is a mixed form of orders (1, 2), while 

(iva) is sometimes called the absolute concomitant of the field. 

This classification, however, does not go very deep, because 
all concomitants can be treated as invariants, merely by adjoining 
suitable ground forms. Since, when x\ u—^ u\ 

Wj x^ + . -f u-aX,, x,^, 


we may regard the variables %, . . . , as coefficients of a certain 
linear form where Ui= Correlatively the variables x 
may be regarded as coefficients of a certain linear form where 
Xi = a,;. And, in fact, the problem of finding covariants of a ground 
form f is i'ndistinguishable from that of finding invariants of two 
ground forms — f and a linear form u^, treating u as the variable 
in the latter (cf. p. 145). 

If we treat u, v, w;, . . . , x, y, 2 , ... as such coefficient sets, and 
use U, X for variables, we reduce all concomitants involving 
ground forms /together with w, v,w,.,,,x,y,z,,,, to invariants 
of forms/ together with linear forms Wy , . . . , ?7^, 

Hence the fundamental theorem covers the case of all con- 
comitants of all four types (i), (ii), (iii), and (iv). 


Example , — ^ 

The polar process Syj - — is a particular case of the Aronhold 

{ OXi 

process (p. 140). For it is the latter applied to a linear form 17^.. 


The same remark is true of any polar process 
polarization is an invariant process (p. 141). 



&c. 


Hence 


9. Convolution and Resolution* 

The absolute concomitant (iva) can take various forms since 
by the methods of §8, p. 86, x can be resolved into n — 1 
components Vy Wy t or correlatively u can be resolved into 
71—1 such as y, . . . , 2 . 



2o8 multilinear FORMS [Chap. 

For ternary forms if x, y, z are three such points, and u = yz, 

zXyW==^ xy are three lines forming the sides of their triangle, 
then the expressions 

{xyz) = {unw) = {yz . zx . xy) = {xyz)^ 

are absolute concomitants. Similarly for higher fields. This 
process of replacing one by many variables is called resolution, 
the converse being composition or convolution. 

Exactly the same processes apply to the symbols; one a 
can be replaced by n — 1 such as a', a", . . . , and one a by 

n — 1 such as a', a", . . . , 

Example . — 

The ternary quadric aj^ could be symbolized by (a' a" x)^, 
where = (a &c. 

10. The Fundamental Theorem for the General Case. 

In order to cover all cases contemplated in the classification 
of (12) above, we must consider a possible ground form with 
several sets of variables and perhaps distinct fields of linear 
transformation. The case of absolutely distinct fields will be 
dealt with later in §4, p. 240, but to round off the present in- 
vestigation we must contemplate the ground form which contains 
rth compound co-ordinate sets, r — 2, 3, . . . , n — 2, the case 
r = n 1 having already been included. 

As an example, consider the multilinear form in three 
variables x, y, z 

/= a^hyC, = 

Suppose that it is not the most general polynomial in y and z, 
but changes sign when y and z are interchanged: namely the 
coefficients Ai^j^ obey the law 

for alWalues of j and h Then after interchange we write 

-/= aJ)^c,j =^-- X/Zjy^, 

whence by subtraction 

2 / = - b.Cy) = Xi {yjZ„ ~ y^Zj) 

= a^(bc\yz) = 'EAijtXi(yz)jt. 



209 


XII.] GENERAL FUNDAMENTAL THEOREM 

This is now a bilinear form in two sets, x and a second compound 

TTg - ^ yz. We note that / is symbolized by two symbolic linear 
factors, one for x as before, and the other, (be | yz), a type already 
familiar through the theorem of corresponding matrices. 
Furthermore, if 

jk, I m 

are algebraic complements among n suffixes jkl , . . m deranged 
from 123 . . . n, and if variables v . , . iv, contragredient to y, are 
introduced, we have by the principle of duality 

(bc\yz) = {bcv...w). (13) 

But this last is an ordinary bracket factor of n symbols, which 
we can also write as 

(boo . . , w)^ . . . • ( 14 ) 

where is the n-rowed matrix of currency (n — 2). 

More generally, by the same reasoning, a form 
q = byC^d^e,^ . . . 

alternating in y, z, also in y\ z\ also in ?y", z", and so on for q 
pairs, is symbolized more explicitly by 

{be I yz) {de \y'z')... = (6c | n^) {de\iT^)... = (6c {de 

and if these second compounds 773, TTg' ... happen to be equal, 
the form is 

(be I 7T2) (de 1 7T2) . . . 

of degree q in 773. 

What has been said of these second compounds would also 
apply to other values of r. In this way we are led to consider 
such a ground form as 

F = aX< ■ • • (bo\rri) (6'c'|,r/) . . . {def\^,) {d'e'f\7r^') . , (15) 

with q^ factors like ql involving second, q^ third, . . . , and 
finally q^^i, (n — l)th compounds of type u. When x=y^Zy 
TTg ™ TTg', . , . , 77,. = 77,.', . . . , tWs is Called a mixed groimd form 
of orders 

?, > 0 . • • • ( 16 ) 

IS 


( D 884 ) 



210 


MULTILINEAR FORMS 


[Chap. 


in the variables a?, TTg, . . . , = w. At a later stage it will be 

seen, as Clebsch was the first to prove, that this is the most 
general type of ground form necessarily occurring in the invariant 
theory of one set of variables and all its compounds. At present 
we have merely found that such a form ntay arise. With this 
theorem of Clebsch ultimately in view to give significance to 
the form F, once more we adapt the Fundamental Theorem to 
include jP as a possible ground form. 

11. Proof of the Fundamental Theorem. 

To effect this, it is only necessary to show that the new type 
of factor in F, due to the rth compound (r = 2, 3, . . . , n —• 2) 
leads to the same general final result as before, namely an aggre- 
gate of types 

Ua, {ahc . . . m), (ajSy . . . /i). 

The typical new type of factor in F is 

{A,. I w,,) = {a^a ^ ... a, I ir,) = (Oittg . . .a,\ yz . , .t), 

(r=2, 3, n— 2), 

where each is a symbol cogredient with a or w, and where 
y, z, , , , , t are r variables cogredient with x. We denote the 
expanded form of this type of factor by 

1 ^Aj.^y^'2z • • • ^rt9 

where is the determinantal permutation of r\ terms previously 
denoted by 

• • • 

UiUg ... a,.. 

Let this be done for each such factor, where r = 2, 3, . . . , w — 2, 
so that F is now expressed symbolically in terms of symbols and 
variables a, w, a, x as before, except that certain groups of 
symbols a undergo determinantal permutation. 

For ifistance, if A denotes aa' and J8, bb\ the form 

(^4 1 yz) {B I xt) = (aa* | yz) (bV | xt) 

is now written 



XIL] 


PROOF OF GENERAL CASE 


211 


It consists of four terms due to simultaneous derangement of 
aa' and of 66'. Clearly we can write it in many ways 

and so on. 

Each operating symbol £2 now denotes a group implicitly 
convolved (§10, p. 46) in the operand, as shown by the sufibc 
of Q. 

If there are v such groups J5, . . . , iC in the form F we 
denote the expanded form of F by 

F = £2 j £2^ . . . O^j^aiy . . . 

where G denotes all other factors of F not so affected. 

Now any actual coefficient of F after linear trans- 

formation is obtained symbolically by substituting one or other 
of I, . . . , o) for each variable ^ . . . occurring in jP. Hence 
it only differs from the result in the previous case by having v 
groups of symbols implicitly convolved. 

Since these symbols . , . , do not include . . . . a>, 
the Corollary IV, §11, p. 49, applies when we permute . . . , a>, 
as is the case when the Cayley operator £2 acts upon a supposed 
invariant. Thus we can perform each step of the previous proof 
of the Fundamental Theorem, and obtain the same result as 
before, only modified in this respect, that 

In the final aggregate of types 

tta, (a6c...m), (apy.,.p) 

every group of symbols a^, ag, . . . , a,, which were convolved in the 
original ground form, still preserve this property implicitly in the 
symbolic form of the invariant. 

With this proviso the theorem has now been completely 
established for mixed ground forms in any number of compound 
variables for the field of order n. 


Example . — 

In quaternary forms three types of variable are used 


The form 


uvw == X, uv = p= xy, u = xyz. 
F = '^kijpj^i = (aa'p) = (aa'uv) 



MULTILINEAR FORMS 


212 


[Chap. XII . 


is called a linear complex. If equivalent symbols are used, we 

{aa^uv) = {bVuv) == [cc'uv) ^ {ddfuv). 

Hence a possible invariant of weight two has the term 
{aa'bc) (6'c'dd'), which gives rise to a four-term series due to 
the couple of alternatives 6, 6'; — 6', b and of c, c'; — c', c. 
According to the Fundamental Theorem this series would be 

“ I = ilMaa'hc) (bVM’). 

where denotes the derangement of 66', and of cc'. Such a 
notation is preferable to the dot notation when several indepen- 
dent derangements proceed simultaneously. 


EXAMPLES 1 

1. If the above invariant is written as 

I — — QjiClc (h'au'c) (c'dd'b) — — { BACD } 
where A denotes oa', and -B, hh\ and so on, prove that 

{BACD} = \GDBA} = {BDCA} = [CABD] , 

2. Using identity I, p. 45, to convolve cc' explicitly in I, prove that 

{BACD} + {BCAD} = 2(AC) (DB), 
where (AC) == (aa'cc'). Hence prove that 

I == (AB) (CD) -f (AC) (BD) - (AD) (BC). 

This procedure renders each of the four convolutions A, By C, D explicit. 
Originally two were implicit. 

3. Let J = Qlq (ahh'c) (c'dd'd") — (aBCh), say. Then J is a simul- 
taneous invariant of a plane a, two linear complexes B and C, and a point 
8. Here a is cogredient with u, B and C each with p, and 8 with x. 
Prove the identity 

(aBC8) + (aCB8) -f (BC)as = 0. 

4 . An invariant exists, linear in the coefficients of three linear com- 
plexes and two planes. 

Let X = £1(7 (ahh'c) (c'dd'e) —■ (aBCDe)y where B, (7, D refer to three 
complexes, and a, e to two planes. Prove that (aBCDe) = — (eDCBa), 
and. that 

(aBCDe) ■^(aCBDe) + (BC)(aDe)=^0. 

5. Prove that (uBCDu), which is a special case of the expression K, is 
an alternating function of B, C, D. Thus (uBCDu) = — (uCBDu) 
= (uCDBu) =: &c. 

3. Discuss the corresponding dual invariant 

K' - Q]iQn(o^b)(b'cc'd)(d'z) 
of two points a, 3 and three complexes By C, D. 

1 For references see §13, p. 330. 



CHAPTER XIII 


Symbolic Methods op Reduction 

1. The Fundamental Identities. 

The First Fundamental Theorem has provided a uniform in 
which any concomitant can be dressed, together with a means 
of constructing as many as we like. The next stage in the theory 
is to learn how such symbolic forms behave, to simplify or reduce 
or transform them. Like ordinary numbers and like matrices, 
they have their fundamental properties or rules of combination. 
For binary forms these run as follows: 

(i) {ab) = -{ba), 

(ii) (6c)a^+ (ai)c^ = 0, 

(iii) (6c) {ad) + (ca) {bd) + (a6) {cd) = 0, 

(iv) a^b,,-a^b^={ab){xy). 

(v) The interchange of equivalent symbols. 

All these properties have been established and illustrated. It 
is perhaps the last which presents the greatest novelties and 
apparent drawbacks of the symbolic methods, because it leads 
to alternative expressions for one and the same invariant, 
without suggesting which of them is to be taken as the simpler, 
Thus these three expressions 

{ab) (ac) b^ c*, \ {ab)^cj^, {ba) {be) 
are all equal to (cfoag — a^^) X / for the binary quacbatic 

Again, as in the theory of determinants, very substantial functions 
such as (6c) (ca) {ab) can vanish identically — but for a new reason, 

213 



2U SYMBOLIC METHODS OF REDUCTION [Chap. 

owing to the operation (v). For if a, b are equivalent, then on 
interchange ^ 

and by three applications of (i), this last is —{ca) {be) {ab) whence 
2{bc) (ca) (ab) == 0 or (be) (ca) (ab) =- 0. Cf. §3, p. 18. 

For forms in n variables analogous properties hold, but owing 
to the two types of symbol a and a with contragredient behaviour, 
there are now the following laws: 

(i) (ab m) = — (ba .m) = , 

(a^ ... /x) = — (^a ... /i) , 

(ii) Hi — (ab...m) (na) 

— {nb...m) (aa) + . . . + (nab . . .) (ma) s 0, 
Hi = (aP . . . (i) (va) 

— (v^ . . .fx) (aa) + • • • + (— )’*(»’«^ • • • ) (/*“) - 0, 

(iii) 112 = . . .w)— (ub. . .m) (av . . .w) ~ 0, 

U 2 '=(ap....fi)(xy...z) — (xp...fi)(ay...z) + ... 0, 

(iv) Ha = (ab. . .m) (ajS ... ft) — 2 ± (aa) . . . (my) = 0, 

(v) Interchange of equivalent symbols a with 6 or a with jS. 

Here IIi and Hi' are dual identities, and so also are 112 and 112'- 
They follow from (IV), §9, p. 46, while Fig is a statement of the 
product theorem of two determinants. 

2. The Second Fundamental Theorem. 

The question now arises, granted that these identities teach 
us something of the properties of symbolic forms, do they leave 
any gaps, do any properties escape? The answer is given by a 
very remarkable theorem usually called the Second Fundamental 
Theorem, which was originally proved for binary forms by 
Gordan.^ Later a proof when n — S was given by E. Study ^ 
and for the general case by E. Pascal,® and recently for com- 
pound co-ordinates and for restricted transformations by R. 
Weitzenbock.^ 

^ Cf. Gordan, Invar iaidenthtorie^ Bd. II, §117: Pascal, Battaglini, 26 (1888). 
Grace and Young, Algebra of Invariartis (19(H)» p. 368. 

* Study, Methoden . . . (Leipzig, 1889), p. 75 and p. 204, 

* Pascal, Mem. del R. Acc. dei Lincei, V, 4a (1888). 

* Wiener Berichte, 122 (1913). Cf. Invarianlentheorie; pp. 98-113, by the 
same author. 



XIII.] THE SECOND FUNDAMENTAL IDENTITY 215 

The theorem states: 

Every identity satisfied by invariants {(^ncomitants) can uUi- 
7nately be expressed by the fundamental identities together with the 
principle of the interchange of equivalent symbols and the laws 
of ordinary algebra for the counbination of the thr^ elc'inentary 
types aa, (abc . . . m), (ajSy . . . /i). 

In other words the symbolic theory is a complete and self- 
contained discipline; and from the logical point of view it is this 
which gives it a permanent importance. 

The theorem tells us that any such polynomial identity 11 
symbolizing such an actual identity in coefficients and variables 
of ground forms can be expressed as 

n = ^llll + ^ 2^1 “1“ ^ 3^2 H" *^4^2^ + ^5 ^3 = 

where, of course, these coefficients Ai need not vanish identically. 

3. Binary Quadratic Forms. Reducibility. 

As an illustration of the symbolic principles of reduction, 
we shall consider the problem of finding all possible different 
types of concomitants of a set of binary quadratics. 

Let /i = au -f 2a^^ x^ + a^^ ^2 == . 

h == Wi + 2^12 ^2 + ^22 ^2^ == 

f^ = ^ fl^ 2 ^ ^ gQ fQj. letter a, b, c, d . , , we 

have relations 

= (^ji = = <^j <^h i i, = 1 , 2. . . (2) 

If we consider merely one variable x, together with these co*^ 
efficients, every concomitant is expressible, by the fundamental 
theorem, as an aggregate of factors (a6), where every letter 
a or 6 is duplicated in each term. For example, 

7 = (ab) b^ W=c^ (cd)d^ 

are covariants, and so also is 

{ah)4cd)a^b^cj^, 

which is easily seen to b§ the same as VW, and is said to be 
reducible because it is expressible in terms of simpler covariants. 
But 7 itself is not reducible by such factorizing because no 
partition into factors {ab), b.j^ gives an actual co variant in non- 



2i6 symbolic methods of reduction [Chap. 

symbolic form. The covariants V and W are said to be of the 
same type because they merely differ by choice of symbols 
a, b, c, d of the ground forms; their structure is the same. 

Rejecting forms of the same type and obviously reducible 
forms, we can immediately write down a list of possible single- 
term invariants. They would be 

(a6)2=:2)i2, 

(ab) (be) (ca) = D123, 

(ab) (be) (cd) (da) == 


with a similar list of covariants 

a^(a6)6^=Ci2, 
a^(a6) (fc)c^=(7i23, 
a^ (ob) (be) (ed)d^ 


For these are the only possible structures, involving two a’s, 
two 5 ’s . . . which follow the conditions laid down. Thus if , 
Dfj^ refer to quadratics every concomitant must be 

a polynomial in 0 ,;^ , Dij . We may, however, reject all but 

the first two entries in each list as expressible in terms of these 
simpler concomitants. 

In fact by a fundamental identity since 

{bc)a^+{ab)c^={ac)bjc, 

therefore, squaring both sides, 

(be)^aj^ + (ab)%^ + 2 a^ (ab) (be)c^ = (ae)^bj^, 
or 2 ( 7 i 23 = A3/2 -^12/3. ~ ^23 fv 

On dividing by 2 this expresses Cjgs in terms of the simpler 
Dijf fi. Likewise for Cy*. 

Next by polarizing this identity with regard to y2, and 

/ d \ 

remembering (y we have 

N dx^ 

a^iab) (6c)c* + a„{ah) (bc)Cy = - A 2 

identically true for all values of y. 



XIII.] BINARY QUADRATIC TYPES 217 

In particular, if ■= — — d^d^, then 

c,, = (cd)4; ■ 

whence 

(ab) (be) (cd)d^ + dj.(ad) (ab) (bc)c^ = Dy^bi^(bd)d.j. — &c. 

But, by a fundamental identity, 

(ab) (be) (ed)d^ — d^ (ab) (be) (ad) Cj.= d^ (ah) (be) (ea) d^ 

= 

Adding these two results, 

2o,5 (ab) (be) (ed)d^ — • • • H“ 

which expresses C1234 polynomially in terms of D^j, GijJi- 
Similarly by putting yg ~ ^\(de)e^, &c., we reduce 0^2346; and 
so on for all successive entries in the column of (7’s. 

Further by squaring the identity 

{be) {(id) + {ah) (cd) = {ac) {bd) 

we reduce 2)1234, or what is the same thing by putting = d^, 
X2 == — di in Ci 23 we effect this reduction. Each further entry 
of the D column is reduced by similar substitution for x. Thus 
we have proved the theorem: 

All concomitants of any number of binary quadratics are 
expressible in terins of four types: 

a/, {ab) a^ {ab)^, {be) {ca) {ab) , 

Corollary. — Since (ab)a^b3.™ — (ba)a^bx, it vanishes identi- 
cdlly when the two quadratics are the same, by interchanging 
equivalent symbols. Similarly for (be) (ca) (ab). Cf. §1, p. 214. 

Hence 0 ^= 0 , leaving/., 2).;, D^j, (where 

i, j, k are unequal) as the only possible irreducible concomitants 
of the quadratics. All these, except the last, have already been 
discussed in Chapter VIII. * 

We now see that they are completely typical of all possible 
concomitants. By no possibility can any of them be expressed 
rationally and integrally in terms of the others, as the reader 
will see if an attempt is made to do so. Accordingly they are 
said to form a complete system, and every polynomial con- 



2i8 


SYMBOLIC METHODS OF REDUCTION [Chap. 

comitant is expressible as a pol 3 rnomial function of members 
of the complete system 

Cif, A.-, Dij, Diji,)- 

The following table can now be made. 

Complete System of Binary Quadratics 


Number of 
Ground 
Forms. 

Invariants. 

Co variants. 

Degree 2. 

Degree 3. 

Degree 1. 

Degree 2. 

1 

1 

0 

1 

0 

2 

3 

0 

2 

1 

n 

n(n -{- 1 ) 

n(n — 1) (w — 2) 

n 

n(n — 1 ) 

2 

6 

2 


4. Significance of the Complete System. 

In §9, p. 139, we found the discriminant of the pencil of 
quadratics U + AF. Hence the discriminant for /i + A/g 
will be 

Ki<^22 — + ^(«11^22 + «22^11 “ + ^^611632 V)- 

Symbolically this is 

HW+2A(a6)2 + A2(66T}, 

where Thus the discriminant 

of the pencil + A/g is 

^(-^11 2AZ)j2 “F A^J522). 

Hence is the discriminant of the quadratic /j-, and Dij 
is the simultaneous invariant, sometimes called the harmonic 
invariant of two quadratics /* and^. 

Next, the other type of invariant can be written as, 

6162 
C 1 C 2 


A23 = (M == — 




XIIL] COMPLETE SYSTEM OF QUADRATICS 219 

which is easy to verify. Hence in terms of the actual coefficients 
it is 

011 0^2 «22 
• ^11 ^12 ^22 • 

Cii C12 C22 

This important invariant is called the determinant of the co- 
efficients of three quadratics. When it vanishes the quadratics 
are said to be in involution. But such a determinant vanishes 
when and only when the three quadratics are linearly dependent, 
§2, p. 75. In fact values A^, A2, A3 exist such that 

\fi + ^ 2/2 + KA = ^ 


identically for all values of X 2 . 

Since a binary quadratic has three coefficients, three quadratics 
naturally lead to a square coefficient matrix. If the rank of this 
is less than three, the determinant vanishes. 

Next, there is the covariant Cj2=- (a6)a^6^, which is the 
Jacobian, already noticed in §12, p. 144. For 


aa/ dbi 

dxi dxi 
da/ db/ 
dX2 Sx2 


®a:®2 




5. Canonical Form of Two Binary Quadratics. 

Suppose the Jacobian J — {ab)a^b^ has two distinct linear 
factors X, Y, Since these are linear in ccj, Xg we can take them 
as new variables. Let the quadratics /^, /g now be 

A^^X^+2A^2^Y+A2J\ B^^X^ + 2B^2^Y + B2J\ (3) 

Eeference to (43), p. 144, shows that J takes the required form 
AZy if, and only if, 

'^ 12^11 ~ '^ 12‘®22 '^ 22’®12 ^ (^) 

For these vanishing expressions are the coefficients of X^ and Y^ 
in J. This requires either the original quadratics to differ by 
a mere constant multiplier so that 

-^11 ‘ ^12 • -^22 ~ -®11 ’ “^12 ‘ ■®22> 


. . (5) 



220 SYMBOLIC METHODS OF REDUCTION [Chap. 

or else Ai2 == == 0. In general the first alternative is not true; 

hence two distinct quadratics whose Jacobian has distinct linear 
factors can be expressed as the sums of squares 

+ . ( 6 ) 

Further, if the quadratics have non- vanishing discriminants, none 
of All, ^iv B22 vanish, otherwise fi or/2 becomes a perfect 
square. We can therefore take 

Xi = \/AiiX, X2=s/A22Yf • • • ( 7 ) 

as a new linear transformation, finally obtaining 

fl^Xl^ + X2^ f2=AiXi^ + A2X2^ . ( 8 ) 

where Ai == BufAn, A2 — 522/^22* This is called the canonical 
form of two binary quadratics, to which any such ground forms 
can be reduced provided that they satisfy the conditions inci- 
dentally utilized above. 

If we write M for the modulus of the transformation from the 
original variables Xi, X2 to Xi, X2 then the Jacobian and dis- 
criminants of the canonical forms are (p. 144) multiples of the 
original Jacobian and discriminants, namely 

MJ, MWii, M^Di2, MW22^ 

Thus by direct calculation we find 

MJ^{A2--Ai)XiX2, M^Dii=^2, M^Di2^ Ai + A2, 

M2D22 = 2 ^i ^2 (9) 

Hence the concomitants satisfy an identical relation 

2 == 21)12/1/2 ^11/2^ “ ^>22/1^9 • . (10) 

as is readily verified from the canonical forms. This is the only 
relation which exists between these six forms, for in (8) and (9) 
the five quantities M, Ai, A2, Xi, X2 are arbitrary and any further 
relation between the concomitants would reduce the number of 
these arbitrary parameters to four. 



Xlll.j CANONICAL QUADRATICS 221 

EXAAIPLES 

1. Find the discriminant of J, and its significance. 

2. Prove symbolically that the simultaneous invariant of the Jacobian 
of two quadratics /i, and another quadratic /j, is 

6. Extension to Forms 0 ! Higher Order. 

Before giving a geometrical interpretation of these results, 
it is worth noting how they may be extended. Since the symbols 
• • • behave like ordinary numbers, any identity, 
or relation, established for quadratics necessarily gives inform- 
ation about cubics and higher forms. 

EXAMPLES 

1. Just as (ah)axhx is the Jacobian of two quadratics, 

is, to a constant numerical factor mn, the Jacobian of an m-ic and 
an w-ic hx''** 

2. Since 2(ah){ac)hxCx = ((ih)^Cx^ -f — (bc)^ax^ (§3, p. 216), it 

follows that 

2 (a6) (ac) ^ “ ^ == 

when m > 1, n > I, p > h 

In such identities the significant parts of each term are the bracket 
factors, for, if these are known, the a;-factors ax, bx, Cx follow automatically 
to make up the requisite number of symbols for each term. 

7. Transvectants. 

Definition of Transvectant. — The covariant (ab)** 
of binary qualities f = a^"’, <l> — b/ is their vth transvectant, and 
is often written 

If r exceeds the lesser of m and n, the transvectant is zero: 
if r — m == n, it is an invariant: and if r == 1 it is, to a constant 
factor, their Jacobian, written (/, with the index omitted. 

We may easily prove that all odd transvectants of a form 
/ with itself vanish identically, and all even transvectants give 
its covariants of degree two. For if 

/=a/‘ = V‘, 

then 

(/./)’■ == iflbY a/'-’- V"-’- = {-Y{haY a/'-’- 



2ZZ SYMBOLIC METHODS OF REDUCTION [Chap. 


which leads to a zero result after interchanging equivalent 
symbols, if r is odd. Also by the fundamental theorem the only 
covariants of degree two are polynomials in {ah), b^: hence 

they are transvectants. 

Another important case is the Hessian of a binary m-ic, namely, 


dxi^ 

dxidx2 


ay 

dxidx2 

ay 
a ^ 2 * 


m^(m — 1)^ 






= m^(m— 1)2 


% bi 

tto ba 


^2 


7 ) 1—2 7 . 7 ) 1—2 


= \rrfi{m 1)2 (a6)2 ^ 


= Jm2(m-1)2(/, /)2=^m2(m-l)2H. 


Here we have started with the Jacobian of the two first polars 
of a binary m-ic, and have thrown it into symbolic 

u X'^ u X2 

form, thereby requiring two equivalent symbols because the 
original determinant was of degree two in the coefficients of /. 
The result (/, /)2 shows that the Hessian is a covariant, a 
feature which can easily be generalized for the Hessian of n 
variables. 

EXAMPLES 

1. The Hessian does not exist for a linear form. 

2. The Hessian of a quadratic is its discriminant. 

3. The Hessian of a binary cubic is a quadratic. 

4. The Hessian of a binary quartic is a quartic. 

5. The coefficient of the highest power of in the Hessian of 

{uq, , ,an \ ^ to a constant numerical factor. 

6. Write in full the Hessian | of a ternary form 

and show that neglecting a numerical factor its symbolic form is 
(abc)^ aa ;«‘-2 6 ^w -2 - 2 . 

7. The bracket factors of the Hessian of an n-ary form = bxP 
s= &c., are {abc . . . w)*. The Hessian is of weight two and degree n. 



223 


XIIL] TRANSVECTANTS AND JACOBIANS 

8 . Reducibility of Jacobians. 

We can now prove two theorems concerning binary Jacobians. 

Theorem I. — The Jacobian of two qimntics^ one of which is a 
Jacobia/n, is reducible. 

This is often quoted as : the Jacobian of a Jacobian is reducible^ 
and it is true of forms of general order n,^ 

Proof— 

Let (/, 4) denote the Jacobian of f = af^^ 

and (f) = bj\ Then the Jacobian of (/, and ift c/ is 

9 (/. 4>) H 

dx^ dx^ 

d(fj) 
dx^ dx^ 

(m— (»— pc/~^Ci 

a2+ (w — pc^-^c^ 
Breaking this up by columns into two determinants it gives 

p[(m— l)(ab){ac)b^c^ 

+ (n — 1 ) (ab) (bc)a^c^ } 

But 2{ab) {ac)b^c^ ™ {ab)^c/ (ac)^6/ — {bc)^a^^ 

and 2 (ab) (be) a^Cj^~ — (ab)^c^ — (bc)^a^ + (acy^b^f. 

Therefore 2 J -f (m + n — 2)p 

= (abfc,^+{acfb,^-{bc)w\ 

Im + w — 2 J 

= -£ ” „ (/. m+(f, ^ff, 

m -f- rt — 2 

which proves the theorem. 

Theorem II. — The product of two binary Jacobians is reducible^ 
for all cases excluding linear forms. 

1 Of. Gilham, Proc, London Math, Soc., 2, £0 (1921-2), 326-328. 



224 SYMBOLIC METHODS OF REDUCTION [Chap. 
Proof , — 

By use of the identity 

(cd)a^ = {ad)Cj, + {ca)d^ 

together with the preceding identities, we have 

2{ah) {cd)a^b^c.^d^ = 2{ab){(ad)c^ + {ca)d^]b^c^d^ 

= {ab)%H/^ f {ad)%^cj^ — {bdfaj^c^^ 

- (ab)^cj^d/ - {acfbj^dj^ + (bcfa/d/ 

giving four terms after cancelling two. Now we multiply through- 
out by 

(w..> 1), 

and interpret the result with regard to four ground forms 

/!=«/'. /2 = V'S /3=0, A^dJ\ 

Hence the Jacobian of fi and/^, multiplied by that of /j and /4 
is equal to 

where &c.; and this effects the reduction. 

In these two theorems we have expressed a more com- 
plicated covariant in terms of covariants of lower degree, 
thereby gaining results which have wide applications. The 
most useful case of the latter result is when f^==f^=f= aj‘\ 
and /2=/^= jy == ^ I'he Hessian of /. In this 

case the two Jacobians in question become 

(/l./2) = (/3./4) = (/. H) 

the important covariant of degree three, usually denoted by the 
letter t. The theorem then tells us that the square of this 
covariant is reducible. On this result, or syzygy as it is called, 
the whole theory of solving binary cubics and quartics depends, 
and also the remarkable theory of finite groups of rotations 
whereby the five regular Platonic solids may be brought into 
coincidence with themselves, a subiect very beautifully discussed 
by F. Klein.i 

^ Lectures on the Icosahedron, translated by G. G. Morrice (London, 1913). 



THE JACOBIAN SYZYGY 


XIILJ 


Z2S 


9. Remarks on the Proof of the Second Fundamental Theorem. 

The existing proofs for the general case of this theorem are long and 
difficult, but can be much simplified with the |iolp of a recent discovery ^ 
by R. Weitzenbdck concerning all possible relations between the com- 
pound co-ordinates nr = (xy . . . 2 )/, where 2 < r w — 2. For a given 

value of r every polynomial relation li(nr) which vanishes identi- 

cally when nr is resolved into elements Xi ,, . . . , 2 /^., can be expressed as a 
finite series 2^^ Hi,, where each ri„ denotes a quadratic relation in nr. Now 
all such p-relations as they are called fall under the type ITg already quoted, 
a well-known instance of which is 

PuPn H- p 2 zPu 4- P^iP 2 i = 0 
between line co-ordinates (xij)ij ~ p/j, when = 4. 

The proof is very like that of the First Fundamental Theorem, re- 
quiring also Bazin’s theorem (Ex. 9, p. 56) and the conception of implicit 
convolution (herausgegriffenen Reihen) for its completion. 

These p-relations have hitherto been known to be sufficient to express 
all relations between each set of compound co-ordinates, because they 
furnish a particular case of the Second Fundamental Theorem. But this 
direct proof, recently found, provides a more direct approach to this 
theorem. 

Another useful method is developed by B. L. van der Waerden, 
Math, Annalen, 95, (1926), 706-736. 

1 Weitzenbock, Math, Annalen, 97, (1927), 788-796; 99 (1928), 493-496. 


( D 884 ) 


16 



CHAPTER XIV 

Seminvariants. Algebraically Complete Systems 

1. Seminvariants and Leading Term of a Concomitant. 

Symbolic methods lead quickly to the important result that 
a binary covariant is completely specified if its leading term is 
known, by which is meant the term with the highest power of 
Xi. By the fundamental theorem the symbolic product 

C={abr{acy...ajb;... 

is a covariant, provided it contains the requisite number of 
symbols a, b, &c., imposed by the ground forms. If this is of 
order to in x^ we may adopt a new symbol and write it as 
Thus 

a/=(a6)^(ac)^..a;V... 

is identically true for all values of x^, x^. Let w ^ p -\~ ^ 

the total index of the bracket factors. 

Now consider the polynomial S defined by taking (rj = 1, 
X 2 = 0, in C: 

S ~ {aby (acy ^ . . . a{b ^ . . . , 

since in this case == b^ — 6^, &c. Then S is a polynomial 
in the coefiicients, and in fact is the leading term of the covariant 
C. When x-^ x\ let Then (p. 184 (8)) since (a'6') 

= (i'r))(oh)f == we have 

S'={a'by{a'cy.,.a^'^'b^'\.. 

= (^7)r{aby{acy,..a{b^^ 

If we divide S' by we obtain the original covariant with 
^2 replacing Xg. So from the leading coefficient of a covariant 
we deduce the whole covariant by making the linear transformation 
on the coefiicients and dividing by a suitable power, w^ of the 
modulus. This process is non-symbolical. 



SEMINVARIANTS 


227 


Chap. XIV.J 

For example, is a covariant of the cubic 

AoXj^+ SAiXi^X2+3A2XiX2^+A3X2^ = a/ = ^/. 

Its leading term is (aj8)^aij8ia7i^ 

= — 2aia2^i^2 + = 2 (^ 0^2 “ ^ 1 ®) 

• 

The leading coefficient is called a seminvarianL Symbolically 
we may define a seminvariant to be a polynomial in the types 
(06), where the suffix is always imity. Non-symbolically the 
seminvariant is usually defined to be an invariant of the restricted 
linear transformation 

x^~ -j- > 

^2 T/2 ^2 > 

where it will be noted that the coefficients ^2 ^re 1, 0. Mani- 
festly the symbol % is now invariantive because 

CLy^ . — « Ci-yy 

whereas the symbol is not. It will further be noted that this 
restricted transformation belongs to what has been called an 
affine group (§7, p. 162).' 


2. Seminvariants as Solutions of Partial Differential Equations. 


The seminvariant provides a useful means of studying the 
concomitants of ground forms, without recourse to the symbolic 
methods. The leading idea which governs this alternative theory 
lies in the solution of a differential equation 


0)S , , a^s , . as , , as . 

^0 K - + 3a2 ^ + • • • + ^ = 0, 


where S is regarded as a polynomial in p + 1 independent 
variables . . . , a^. Sylvester and Cayley first studied this 

equation, basing their results on the fact that if these independent 
variables are coefficients of a binary j)-ic (Uq, a^, . . . , $ Xi, x^^ 

then every polynomial solution of the differential equation Js 
a seminvariant of the y-ic. The seminvariant S has, in the 
words of Sylvester, an annihilator Q, namely 



SEMINVARIANTS 


228 


[Chap. 


Further, every polynomial I annihilated both by Q and the 
corresponding operator 0, given by 


a 


0 = a„^ +2ap_i 


i +... + pai^ . (3) 


is an invariant of the j?-ic. 

Granted the first result, the second follows in various ways. 
Thus it will be seen that 0 is derived from D, by reversing the 
terms of the pAc. Consequently if QS ~ 0 implies /S is a leading 
term of a covariant, then OT — 0 implies T is the final term of a 
CO variant. Symbolically S being composed of types (ab), 

then T would involve (ah) and ag. Accordingly they can only 
be the same expression if %, are absent and type (ab) is left, 
giving an invariant. Or, again, the leading term contains no a? 2 , 
so the final term contains no hence the covariant is free from 
both variables, and is therefore an invariant. 


We therefore consider the annihilator ti. To this end let 
the non-homogeneous form be taken 

fix) = Up = agxP + + . . . + 

so that 


( 4 ) 


dx 

dx^ 


= = p (Uo, Oi, . . . , ap_, ^ X, If 

= pip—l) Z7p_2 = pip — 1) (fflo, <h> •••> «p-2 5 


and so on. Then ii y + h, 

fiy + A) =m + + . . . + h) 

p\ 

a„yP + pU^ih)y^-^+ (|) U^ih)y^-^ + ...+ U,ih) 

where 

lJi(h) = a^h + cq, iJgW = + 2% A + ®2> • (5) 

Now if a, . . . , cu are the roots of f(x) = 0, then any poly- 
nomial F(a^, . . . , is a symmetric polynomial function 
^(a, , co) of these roots. Also since x — h^ the roots 



DIFFERENTIAL EQUATIONS 


XIV.] 


229 


of the corresponding equation for y are a — A, /3 — A, . . . , <w — A. 
So we are led to the identities 


tf>{a, ^,...,0)) = F{ao, , a^) 

— ,M—h)=: F{ao, Ui(h), Up(h) ) 

= «o^ + + 2ai^ + «2, . • •)• 


[ (6) 


Expanding both sides as ascending power series in h and retaining 
only the first power of A, we have by Taylor’s theorem for p 
variables, 

= F{a,,...,a,} + h(a^^ + 2a,^^ + ...+pa^_,^^^^^ 

Thus the two operators are equivalent: 


3 3 I 1 9 

«0o. +2ai^_ +... + ?)ap-i 


’dai 


'da. 


2 


da„ 




the first Q. taking effect on a function F explicitly containing 
second on the same function expressed in terms of 
the roots a, j8, . . . , a>. Hence any solution of Q,F = 0, when 
expressed in terms of the roots is a solution of 


^ I I 


. . ( 8 ) 


But the general solution of this last differential equation is found 
from Lagrange’s auxiliary equations 

da dp doj d(f> 

Y 1 


Independent integrals of this system are = constant, and p — 1 
differences of the roots. So any solution of the equation (8) is a 
function of the differences of the roots, say 

(f> = S{a — P,a — y,...) (9) 

Again, in the alternative form QF = 0, Lagrange’s auxiliary 
equations are 

dc^Q dcL-^ dctf^ dF 

0 Cq 2 % * P^p-^i 0 

Independent integrals of these are F = constant, and, including 



SEMINVARIANTS 


230 

p solutions involving the o’s. 
by the system 


[Chap. 

But such solutions are given 


52=?=ao«2 — V. 

jgg == 00*03 — SOflOjOa + 2oi® 

8 ^ — — Uq*o^ 403*0^03 “f~ 6030^*03 3 o-^^ 


• (D) 


Sj, — 03** *Op 1 

• + g)a3'’-*Oi*o^_2 - . . . + (-r-^ (i> - IX 


For these polynomials are the numerators of the rational functions 
Z7o{A), ?7i(A), . . . , ?7p(A) when h = — aja^y as is seen by sub- 
stitution in (5). Also each Ui(h) is a symmetric function of the 
roots of f{y + A) = 0 as an equation in y, and further, in this 
particular case each root for 1/ is a function of the differences of 
the roots a, jS, . . . , a> for cc, since 

a-A = a + ^ = o- — = jg— + : • • + 

ao p p 

Hence Ui (aJaQ) is a particular function of type S. Finally since 
the p fimctions Sj are independent including not found 
in Si), the general solution of the equation QF = 0 is established; 
namely F is an explicit function of the p polynomials Si, 

F {Sq, S29 S ^, . . . , Sp), 

Thus a polynomial in these Si is a polynomial in the differences 
of roots oi f{x) = 0. 

Finally, such a function satisfies the seminvariant condition; 
since in the present case x= Xii X2, so that x — ► a?' becomes 

-f" T|i . / , 

X = — . i.e. rj2X= X 

V2 

whence rj2a = a' + connecting any root a with a the 
corresponding root of the p-ic in x\ Thus r}2(a — j8) = a' ■— j8'. 


Accordingly if is a homogeneous function of degree w in 
the differences of the roots a — the corresponding transformed 
function of a' — is ^ multiplied by 7)2^, depending only 



XIV.] 


ANNIHILATORS 


231 

on the coefficients of the transformation. Thus ^ is a semin variant 
and so also is its alternative form F(Sq, . . . , S^). 

It may be noticed that the degree and weight of Si are equal 
to i. If Si is the leading coefficient in a covariant of order to, 
then (§4, p. 186) the valency condition 2w w = pq gives 

2i~\-m-==ip or TO — ifp— 2). 

EXAMPLES 

1 . For a linear form a/ = 0 {i > 1 ), Sq alone exists. For a quadratic 
8q is leader of the ground form and 8^ is the invariant discriminant. 

2 . Verify that CI82 = 08 2 = 0 for the quadratic. 

3 . Verify that 08 i 4= 0 , p > i > 2 , 

4 . Prove that 82 is the leader of the Hessian covariant of a p-ic, namely, 
dxi^ dxidx2 
dxidx2 dx^ 

5 . Show that 8^ is the leader of the Jacobian of a j)-ic / and its 
Hessian H* We write this covariant (/, H), 

6 . Express 82^ 8^ in symbolic form for / = ax^ = hx^ = c*p. 

7 . What covariants are symbolized by 

(ahYaxP-nxV-^ ? 

8 . Write down the most general polynomial of degree 3 and weight 6 

in the coefficients Uq* <^4 ^ binary quartic. Show that if fl or 

if 0 annihilates it, then it is the J invariant. 

^2 

Ans. Xuo«2«4 + + p«i«2«3 + crag®, 02 

0,2 Gtii 

9 . A homogeneous, isobaric polynomial in Uq. Uj, , . . , is a gradient. 
Prove for any gradient of degree q and weight w 

(0.0- Oa)Q^(pq-2w)G, 

10. What further condition is necessary for Q to be an invariant? 
pq = 2w, 

3. Algebraically Complete Systems. Syzygies. 

With the help of the theory of linear partial differential 
equations we have found a set of p seminvariants in terms of 
which the most general seminvariant of a binary p-ic can be 



ALGEBRAICALLY COMPLETE SYSTEMS [Chap. 


232 

expressed. Such a set is called algebraically complete because it 
can be proved that the relations connecting other seminvariants 
with these p forms Si are expressible in a rational polynomial 
form. They are called syzygies. 

If S\ S" ... are seminvariants connected by a syzygy 
^ = 0, we suppose <f} to be isobaric in the original coefficients, 
otherwise it breaks up into isobaric terms each of which is itself 
a syzygy, exactly as was the case in §2, p. 171. 

Since a covariant is determined by its leading term, it follows 
that any such syzygy between seminvariants leads to an exact 
replica between co variants (and invariants). The symbolic proof 
is instantaneous. For if the syzygy is written 

and Wp is the weight of then the sum 

must be the same for every term. Hence if we put the syzygy 
into symbols and then change into and into &c., 
we have 

+ . * . . . . == 0 , 

or = 

i.e. the covariants C', C" . . . are connected by the same relation 
as the seminvariants. 

It follows that the theory of algebraically complete systems 
of concomitants can at once be derived from such a theory for 
seminvariants. 

This theory of algebraically complete systems and of binary 
annihilators t2, 0, for a single ground form, can be extended. 

For example,^ if several ground forms are concerned, each has 
its own operators Q and 0. It can be proved that a simultaneous 
invariant is annihilated by the sums of these operators, namely 

SQZ-=0, S07 = 0. 

A special case of this is the covariant C of one ground form, which 
may be regarded as an invariant of the system of this ground 
form with the linear form whose coefficients are — Xq, x^. 

1 Elliott, Algebra of QuarUics (Oxford, 1913), .^20-124. 



*33 


XIV.] LEADING TERM OF A CO VARIANT 
The Q of this latter is simply — and its 0 is — . 

OXy 0x2 

Accordingly « 

0x1 0x2 

Thus a covariant is a polynomial, satisfying the valency con- 
ditions, ^vhich is annihilated by these two operators. 

EXAMPLES 

1 Qf cubic 

® 0-cc ^ 

*7 ^dx,' 

2, Apply the corresponding test to the simultaneous invariant 
ao &2 4- ctg&o of iiwo binary quadratics. 

Again, there are corresponding operators for ternary and 
higher categories, and analogous results giving finite algebraically 
complete systems, some of which have considerable importance 
in other branches of mathematics. Recently Forsyth^ has 
given results which deal with quadratic ground forms involving 
one or more homogeneous sets of variables ^ 4 - The 

solutions so found are the functions needed in formulating the 
physical invariants of the Relativity Theory. 

Example , — 

An algebraically complete system is that of five concomitants /j, /g, 
i)ii, Dj,, 1)22 of two quadratics (§5, p. 219). For the sixth J can be expressed 
algebraically but irrationally in terms of them as 

(^12/1/2-4^11/2*- 1^22/1*)^ 

owing to the syzygy 

2 J 2 := 2 A 2 /i/ 2 - ^11/2* -^22/1* 

which connects them. Or again, this relation might be used to give one 
invariant, Djg or or Dgy nationally but not integrally in terms of the 
other five forms. 

4. Irreducibility. Gordan’s Theorem. 

The question now arises, are we to break the integrity of our 
work by introducing an awkward irrationality by solving this 
equation for J? The instinct of all the great algebraists of last 
century has been to say, No. Far more is gained by retaining 
the set of six polynomial functions than by rejecting one of them 

1 Proc, Royal 80 c, Edinburgh, 42 (1921-2), 147-212. 


1. Test, the Hessian 


aoiTi + a^X2, a^x-^ -{- 

«i^i 4- <*2^1 4- 


(«(), ^ 2 , ^3 x^, X 2 )^ with the operators Cl — X 2 



234 ALGEBIV^ICALLY COMPLETE SYSTEMS [Chap. XIV. 

at the expense of symmetry. We are here in touch with a big 
question, one which is not often explained and reasoned out 
in an English elementary textbook. Perhaps our national love 
of independence is at work, and we unconsciously admire p 
things essentially distinct, rather than p + q things dependent 
on q binding relations! Anyhow we have to thank Cayley and 
Sylvester for their original handling of the dilemma, with their 
insistence on maintaining the pol 3 momial character of these 
functions at all costs. So we are led, from the algebraically 
complete system of concomitants, to the broader conception of 
the irreducible system, and with it, Gordan’s theorem. 

Because every polynomial concomitant of two binary quad- 
ratics can be expressed rationally and integrally in terms of six, 
but not of five or less of them, these six are called the irreducible 
system of two quadratics. Analogous systems hold for a single 
binary cubic, quartic, and w-ic. 

For a time after Cayley first broached the subject in 1856 
the conviction began to gain ground that for values of n greater 
than four the system was infinite. Then in 1868 Gordan sprang 
his great surprise on the algebraic world by proving that the 
irreducible system of a binary n-ic infinite) and this in short is 
his great theorem. He perfected the proof in three stages, extend- 
ing it from the original case of one to any number of binary 
ground forms.^ 

In 1890 Hilbert 2 gave an alternative proof applicable to 
forms of all categories. This proof, which we shall soon consider 
in detail, consists of two parts, first establishing a remarkable, 
not to say startling, lemma, sometimes called the Basis Theorem, 
of very great generality; and secondly leading by use of the 
Cayley operator to the desired result concerning invariants. 
Hilbert’s lemma is an Existence Theorem: it establishes the 
existence of a certain finite system of functions, but throws no 
light on how to find them. Gordan’s proof, on the other hand, 
actually provides its own solution. Both methods were very great 
achievements. 

^Cayley, Second Memoir (1856). Collected Papere, Vol. II, 260-275. 
Gordan, Grelle, 69 (1868), 323-354. On p. 343 the author introduces the term 
complete system (voiles System). Grace and Young, Algebra of Invariants, 
pp. 101-127 contain the proof for binary forms, substantially in the form of 
Grordan^s third proof. For a general survey of the whole problem, see Meyer’s 
Berichte, pp. 134-150. * Math. Annalen, 66 (1§90). 



CHAPTER XV 

The Gordan-Hilbert Finiteness Theorem 

1. Hilbert’s Basis Theorem. 

Let 

• • • > (^) 

be n variables. Further, let there be a given law or specified set 
of conditions whereby /orms F in these n variables are constructed. 
Let this law be such that it leads to an infinite number of such 
forms, each involving the variables and not being merely a con- 
stant. We write 

- Fi, Fg, . . . , F,„ (2) 

to denote the totality S^o oi these forms F,-. 

Suppose further that 

• • • J ..... (3) 

denote forms in X, which are not necessarily contained in the 
system S^q, but which can if necessary be constants. Then 
Hilbert’s Basis Theorem runs as follows: 

From the infinite set of forms Sqo « finite set of forms F^, F 2 , . . . , 
F^ can he selected, such that every form F of the system 
expressed as 

F = i4iFi + i42F2+ . . . + • • W 

These forms F^, Fg, . . . , F,,^ are then said to be the basis of 
the system Sqo* 

To illustrate this we can take the extreme case when the law 
is absolutely general. In this case F^, Fg, . . . , F are the n 
variables X^, X^, . . . , X,^ themselves. For each of these is a 
linear form, and thus falls within the set S 00 \ while any other 
form F can manifestly be expressed as 

AiXi + A 2 X 2 + . . . + A,^X,^, 

where each is a suitable form of one degree lower than that 
of F. 



236 THE GORDAN-HILBERT THEOREM [Chap. 


This basis would fail if the law specifically excluded linear 
forms, because none of would then fall within Sqo* 

Proof . — 

We prove the theorem by induction. For if n = 1, there 
is only one variable and the most general homogeneous 
polynomial, or fo^m, Fi is now simply Then the supposed 

law will produce various positive integral indices ki, of which 
ki, say, is the least or one of the least. Every form is then 
expressible as in (4) with ^ = 1, since will now be a factor 
of every possible form Fi. So the theorem is true if w = 1. 


Next let us assume it true for w — 1 variables Zj, Zg, . . . , 
Z„_i. Let an arbitrary member of the system be of 
degree r in the n variables Zj, Zg, . . . , Z,^. It is then quite 
possible that F^ contains no term involving XJ, for the 
supposed law might expressly exclude this. We can, however, 
guarantee that such a term actually occurs, a fact of importance 
later in the proof, by the following device. 

Through the linear transformation Z — > F, or more expressly, 

Xc = 'Le,jYj, A=le,,.|=t=0, ... (5) 

j 

we change the form Fi into a form Gi, of degree r in the variables 

Q^^cY/+CiY,r^ + ... + c, (c + O). . (6) 

The constant coefficient c of Yf, which alone we need to examine, 
is given by substituting the values X— (i — 1, 2, . . . , n) 
in Fi (§2, pp. 183-4); in fact 

and since F^ is not identically zero, we may choose suitable values 
of these coefficients so as to give c 4= 0. 

Next if we transform members F^ol S^^hy (5), into forms 
Gi, G.j^ . . . in the variables Y, we can prove the theorem for the 
forms F by showing that it holds for G. For if 

. . (7) 

holds for every form G in terms of a basis Gi, , G^^, we merely 
have to reverse the transformation Z F to obtain the desired 
result (4) directly from (7). We therefore proceed to prove the 
result (7), taking 6rj, as given by (6). 



THE BASIS LEMMA 


5^37, 


XV.] 


To this end let each 6?^ (5=2, 3, . . .) be divided by 6rj 
regarded as a function of giving a quotient Thus 

G, = O, + . . . + a. (8) 

Here B^ is identically zero if G^ is of lower degree in 7,^ than G^: 
also some or all of the pol)naomials . . . , may vanish; 

and further, they all may contain n — I variables 

F,i_i but not To such pol 3 niomials, formed as they are 
by a definite law, the Basis Theorem applies. In particular 
for the system 0 , 0 ,... <D . 

we have o, == ^, O,. + 4 A + • • • + • * (9) 

where the suffixes s^i • • • , Sp are fixed integers. 

If we write (8) as 

o.r/-! = (?, - 5.6*1 - ^.Y,r^ - ... - O. 


and let s take the v special values 52 ? • » • y w© obtain the 
system of equations 


0..r,r ^ - 5,.6*1- Y,r^ - ... - a. 


KY,r^: 




. ( 10 ) 


Multiplying these respectively by . . . , A^^ and adding, 

we obtain, by (9), 

+ CG,+z’Y,r‘^-h...+Q;. . . ( 11 ) 

where the significance of C, W,', . . . , Q/ is easily seen. Whence 
by substituting this value of O.Y,,’'"’ in (8), we have, say, 

G, - 5 / 6*1 + A,G,^ + ^/' Y,r^+ . . . + iV 

- S 5,6*i + ^/' y./"' + . . . + ii/' 

= I.BiGi+U,_, (12) 

t 

where i takes v + 1 values, 1, 5^, Sg* • • • » same 

way (8) may be written 6?^= 5^6?!+ where I7y_2 
polynomials of degree in 7,^ at most indicated by their suffixes. 
Proceeding again in this way as from (8) to (12), we deduce an 



238 THE GORDAN-HILBERT THEOREM [Chap. 

expression for 0^ involving degree at most r — 3. This 

procedure, in at most (r — 1) steps, gives Uq, a polynomial 
independent of and therefore one for which the basis 
theorem is true. So finally we express Og as 

^8 = ^ 1^1 + B 2 G 2 + • • • + BjiiGnv 
where m is finite. This proves the theorem. 

The remarkable point about this theorem is its breadth: 
for provided the law of formation of the functions Fi is definite 
it may be as intricate a Taw as we please. For instance, the 
variables Zg, . . . , Z„ may be replaced by any finite set of 
variables, such as the cogredient and contragredient and inter- 
mediate compound sets a:, w, pip tt'*', &c., already adopted. 

Further, the proof is applicable not merely to the field of 
ordinary complex numbers, but to any more restricted field of 
number where the law of division as required in (8), together 
with laws of addition and multiplication for constructing poly- 
nomials, still hold.^ 

2. Proof of Oordan’s Theorem. 

We may now formally prove the Gordan-Hilbert Finiteness 
Theorem, which runs as follows: 

For any finite given set of ground forms, every rational integral 
concomitant of a general linear transformation can he expressed 
rationally and integrally in terms of a finite number of concomitants 
Cl, Cg, . . . , C„j. These m concomitants are said to form the complete 
system /or the given ground forms. 

Proof — 

By adjoining certain linear forms to the given ground forms 
every concomitant of the original system is determined by the 
invariants of this extended system (§8, p. 207). We therefore 
need consider invariants only, these being functions of the 
coefficients of the ground forms. 

Since homogeneous invariants represent all polynomial in- 
variants (§3, p. 171), while the multiplication of a given invariant 
throughout by a non-zero constant does not effectively change 
it, we may consider all invariants as forms in the coefficients 

^ The above is substantially the original proof given by Hilbert. An even 
neater proof was given later by Gordan {Oottinger Nachrichten (1899), 240-242). 
Cf. Grace and Young, Invariants, pp. 178-182. 



XV.] 


PROOF OF GORDAN'S THEOREM 


239 


of the ground forms, derived therefrom by definite laws. Thus 
the Hilbert Basis Theorem applies to any invariant I ; namely 

7= -^ 2 / 2 + • • • + • • (13) 

where Zj, Zg, . . . , Z^,^ is a fixed set of invariants, and each is 
a polynomial in the coefficients of the ground forms, but not 
necessarily an invariant. 

Now let the linear transformation of variables change A.^ to 
A{, Z^ to {^ri . . and I to . .0)^1, Then the Arth 

term on the right of (13) is transformed to a polynomial in 
^ 1 , . . . , of weight w; in order to agree with the left. Further 
it breaks up into, say, r single terms (if Aj^ has r terms), each of 
which has the factor Z^. independent of . . . , All other 
factors can be symbolized by inner and outer products , 

{arj ^ . . . cij), . . . , with a possible common denominator (^r^ . . . co)^, 
exactly as in §7, p, 205. We multiply throughout by this 
denominator, so that (13) is replaced by an identity 

(^ry . . . Z = + <^2 ^2 + • • • + + p > 0, 

where each is a polynomial in these inner and outer products. 

Operating with on both sides as in §7, p. 206, we obtain 

XI = Ifi^Ii + “f" • ‘ • “I" 4= 0), . (14) 

where the fcth term Z^ Z*. is due to the ifcth term of the original 
series, Z^^ being either an invariant or zero. 

Since has degree greater than zero, Z^^ has degree less than 
that of Z. Thus every invariant is expressible polynomially in 
terms of Z^, . . . , Z,„, together with invariants Z^^ of lower degree. 
Treating each Z^^ by the same process as for Z itself, we lower the 
degree of these additional invariants at each stage. Since the 
degree of Z is finite, a finite number of such processes ultimately 
furnishes mere constants, apart from the system Z^, . . . , I So 
we have expressed Z explicitly as a polynomial in Zj, . . . , Z,,/, 
and this proves the theorem. A proof without the help of 
symbolic methods can also be given.^ 

3. Limit to the Number ol Syzygies. 

The distinction between an algebraically complete and an 
irreducibly complete system for given ground forms should now 

^ Cf. Weitzenbock, Invariantentheorie (1923), 145-148. 



240 


THE GORDAN-HILBERT THEOREM [Chap. 

be clear. Manifestly the latter is richer and more elaborate 
than the former, as, for instance, in the case of two binary quad- 
ratics (10), p, 220, where a syzygy connects the six irreducibles 
so as to make five algebraically independent. Evidently, too, 
there will be such syzygies in general between members of the 
irreducible system. 

It is an interesting example of the Basis Theorem to prove 
that for a given complete system of m forms 

^2^ • • • > ^7ni 

the number of polynomial syzygies is limited. 

For let (j(/) = 0 be called a syzygy of the first kind if G{I) 
is a pol 3 niomial in these Z’s which does not vanish identically, 
and which only involves coefficients of the ground forms implicitly 
among these /’s, and which vanishes when each I is expanded 
as a polynomial in these coefficients. 

Then G{I) can if necessary be made homogeneous by adjoin- 
ing another Iq which later can be made equal to unity. To such 
functions, built by definite laws, the Basis Theorem applies. 
Hence every G is expressible as 

GII)==A^G, {!)+... + AM AI) 

in terms of a finite number of such functions. 

4. Multiple Fields. 

Hilbert has pointed out the applicability of his methods to 
the general theory of forms when the variables fall into quite 
distinct fields governed by independent linear transformations. 
The following example ^ sufficiently illustrates the method, which 
may readily be generalized. 

Let F=:A/af^ (15) 

be a form of orders (p, q) in two independent sets of variables 
XanAx: x^,x„...,x,„ . . (16) 

where X belongs to a field of order and x to one of order w. 
Here m and n may be equal or not as we prefer. 

The symbolic coefficient of a typical term is 

A A . .. a,a^ (17) 

^ Cf* Weitzenb(jck, Invariantentheorie, p. 169. 



MULTIPLE FIELDS 


241 


XV.] 


involving p factors with a capital A and q with a small a. The 
actual coefficient could be represented by 

, rs . . . •. 


without recourse to upper suffixes, since the variables X, x are 
not assumed to be contragredient. 

For such forms we require two independent linear trans- 
formations Z — > Z' and x -> x\ associated with which there 
will be polynomial invariants composed of coefficients , ,8. . . . 

If X—HEijXj', x~^e^,jxJ . . . (19) 

j j 

are the linear transformations, with moduli 

S = I 1 =# 0, . . (20) 

then as in §2, p. 170, an invariant satisfies the condition of 
transformation 

( 21 ) 


where If, w, positive integers, are called the weights of I in the 
coefficients of the independent sets of variables. 

For such polynomials in coefficients (18) the two Cayley 
operators 




( 22 ) 


have place. A symbolic polynomial P in the symbols A, a, which 
contains after transformation a weight W of the E^j and w of 
the e^j is such that 

or = / .... (23) 

an invariant. Hence as in the proof of the First Fundamental 
Theorem an invariant is symbolized by aggregates of two sorts 
of bracket factors 

(AB..,Mh (ab,..n) . . ..(24) 

which are symbolic determinants of orders m and n respectively. 
A covariant will also have factors Aj^^, a^. 

Further, the Hilbert Basis Lemma applies; and, for a poly- 
nomial invariant we have 

I == AiIi-{- A 2 IZ • • • 4 “ 


( D 884 ) 


17 



24* the GORDAN-HILBERT theorem [Chap. 

in terms of a properly chosen finite set of invariants Zj, 

Finally the operator acting upon this, leads as before 

to the expression of all invariants in terms of a complete system 
consisting of these invariants Zg, . . . , Z^. 


5. Combinants. 


In a multiple field let F\ be a form linear in one set of variables 
Ai, Ag, . . . , A^„. Then to take the case of two sets of variables 
A, X, the typical form has orders (1, p), namely 




i K ■ 
*/) + 1 . '2 






It is simplest to express this explicitly as a sum of m terms 

Kfl + A2/2 + . . . + \nfm> 

where each /,; is a form of order p in the variables x. Any con- 
comitant of the multiple form F\ is called a combinant of the m 
forms f.f. 

If the independent transformations A— A', a:— > aj' are made 
in the particular case when A' = A, then x alone changes, and 
the concomitant C is therefore invariantive for the form F\ 
regarded as a function of the x^s alone. Hence it is an invariant 
of the simultaneous system /i, /g, . . . , Every combinant is 
therefore a concomitant of the forms /^, but the converse is not 
true. Indeed it is an important problem for a given set of p-ics 
fi, to determine which of their concomitants are combinants. 

Similar remarks apply to cases with more sets of variables 

X, y, 

Example . — 

The Jacobian of two binary forms 

J\ = /2 

is a combinant, since 

+ ^ /2> t^i/i 4- [^ 2 / 2 ) ^ I ^1 I ^ (/i» A ) 

d{XiyX2) [Xg x^) 

= p®(Xfx) [abjaxP'-^bxP-'^. 

The general properties of binary combinants were given by Gordan, Math. 
Annalen, 5 (1872), p. 96. Further references are to be found in Meyer’s Bericktet 
and the same author’s book, Apotaritat xind rationale Curve. Much interesting 
information will also be found in Chapters XI and XIV, Grace and Young, 
Algebra of Invariants. 



XV.] 


DOUBLE BINARY FORMS 

EXAMPLES » 


*43 


1 . Extend the methods of § 6 , p. 173. to show that a double binary 
form can be symbolized as follows; 

so that ttij— 

2 . If this form is written Y = = V" prove that 

{ah) 

is a covariant for independent linear transformation x — > x\ y — > y\ 

3. If 2 = 5 + 2 ' = 5 — tiQ are conjugate complex numbers, where 

5, 7 ) are rectangular Cartesian co-ordinates of a point, then the equation 
of a circle can be written 

azz' 4- 62 + &V 4 - d = 0. 

Determine the conditions for the equation in 5, t) to have real coefficients. 

[Coefficients a, d real; 6 , h' conjugate complex numbers.] 

4. If, further, 2 = : X 2 , 2 ' = yj ; y^, m ^ n, then the equation 

= 0 , expressed in terms of 5 , y), represents a plane curve of order m, 

in general with multiple points of order m at the circular points at 
infinity. 

[Terms of highest degree are (5* 4 - yj®)'"]. 

5« Prove that the linear transformation w = (pz 4- q)l(TZ + s), where 
p8 — is equivalent to a particular case of the two transformations 

a;— > x\ y— > y' above; and that this 2 transformation is equivalent to 
inversion successively in circles orthogonal to each other. 

6 . If the bilinear form (m~ n — 1 ) represents a circle, prove that it 
degenerates to a point circle when (ah) (aP) = 0 . 

^ References to double binary forms; Peano first gave the system for 
bilinear forms (m = n = 1); Battaglini, 20 (1882). For a more direct proof cf. 
Proc. Boy, 80 c. Edinburgh, 43 (1922-3), 43-50 (45). The general theory is given 
by Kasner, Trans. American Math. 80 c., 1 (1900): “ The invariant theory of 
the inversion group also 4 (1903). 

Peano gave the 18 concomitants of the complete system of the (2, 2) form. 
For their geometrical treatment* cf. Turnbull, Proc. Edinburgh Roy. 80 c., 44 
(1923-4), 23-fiO where other references are given; and Vaidyanathaswamy, 
Proc. London Math. 80 c., 2, 24 (1925), 83-102, “ On the rank of the double 
binary form 

The (2, 1) form has been treated by these authors in the works quoted, 
while the system for two (2, 1) forms is given by Saddler, Proc. Edinburgh Boy, 
80 c., 45 (1924-6), 3-13; cf. also 46 (1926-6), 264-282. The same author gives 
the system of the (1, 1, 1) form in Proc. Cambridge Phil. 80 c,, 22 (1923-6), 688 - 
693: cf. also Schwartz, Math. Zeitschrift, 12 (1922), 18-35. 

For a proof of Gordan’s theorem and a general transvectant method of 
discussing the double binary forms, cf. Proc. Edinburgh Math. 80 c., 41 (1922-3), 
116-127; cf. also Gordan Math. Annalen, 38 (1889), 387-389; Study, Math^ 
Annalen, 27 (1886); Lehnen, Dissertation Bonn (1921). Double and multiple 
binary perpetuants are considered by the author in Proc. London Math. 8 oc., 
2,27 (1928), 193-208. 



244 


THE GORDAN-HILBERT THEOREM [Chap. 


7. Prove that if a, a; h, P; c, y; d, S are pairs of symbols not necessarily 
equivalent, the product of two covariants 

{ah) (ap)aa;»«- 1 {cd) (y8)ca5’»~i V‘““^ 

is reducible. 


6. Further Examples of Complete Systems. The Binary Cubic. 

In binary forms the complete system of a single n-ic includes 
invariants and covariants. For a linear form (n = 1) the Funda- 
mental Theorem shows that no concomitant exists beyond the 
form itself. A binary quadratic /= has a complete system 

of two forms: / and its discriminant. That of a binary cubic 

/= a/ == V + 3% + 3a2XiX2^ + (25) 


has been established in various ways. It consists of four forms 


/ 


== 


H = {ahfaj)^ , t = {abf (ca) 

A = (ah)^ (oc) ibd) {cd)^. 


(26) 


Here H is the Hessian of the cubic ground form/ ; t is the Jacobian 
of/ and H; and A is the single invariant, the discriminant of the 
quadratic H. 

Non-symbolically we find 


H=2 


«oai + ai»2 

Ct^ "i" 0^2 2^2 
I 5 

<=(/,H)==i 


~ l ~ ^2 ^2 

+ a^Xz 

dH 

dxi 

dH 


(27) 


(28) 


A — 2 ^4(Uq(12 — ®1*) (®1®3 ®2*) (®0®3 ®1®2)*}’ (29) 

The leading term in t is 

(oo^Ug — 3aoai«2 + 


which contains a seminvariant of degree and weight 3 for its 
coefiScient. 

These four forms/, H, t, A are irreducible but not algebraically 
independent, for they are connected by the syzygy 

2<*+H8 + A/*=0, . . (30) 



XV.] 


THE BINARY CUBIC 


245 


which is a further example of the general fact that the square 
of a Jacobian is reducible. This syzygy is also deducible by 
eliminating x^, from the three equations 

/=aiV+---. 

ff-2(a„a2- . . (31) 

which is an example showing that an invariant can often be 
looked upon as the result of eliminating p unknowns from 
p + 1 equations. 

EXAMPLE 

If M is the modulus of the transformation a?!, X, Y, where X, Y 
are the linear factors of the Hessian, show that this system for the cubic 
can be written 

/ = X3 + ya, H = 2M^XY, t = M^{X^ - Y^), A = - 
and verify the syzygy. 

7. The Binary Quartic Form. 

The complete system of a binary quartic form 

— + iay^x^x^ + ^a^x^x.^ + + 043:2* (32) 

* 

consists of five concomitants 

/= o/, H == (/, /)2 = (o6)2o/6*2, t = (/, H) = (abf(ca)aj>^c„^, 
i={ab)\ j = {bcf(ca)^abf, . . . (33) 

three being covariants and two invariants. 

Non-symbolically the invariants are 

i = 2{Q)q(i^ — 

Uq di 0^2 
y = 6 di (I 2 

d2 d^ d^ 

== &{d^d2d^ — d^a^ — d^di^ — d^ + 2d^a2d^. (34) 

The Hessian H is a quartic of degree two in the coefficients 
of the ground form, while ^ is a sextic. These two covariants 
are evidently analogous to those of the previous cubic ground 



246 THE GORDAN-HILBERT THEOREM [Chap. 

form, as the sjrmbolic expression shows. Corresponding co- 
variants naturally exist for the general n-ic. The invariants 
i and j appear for the first time because they involve four symbols 
a in their bracket factors. They lead to analogous covariants 
of a quintic 

{ab)^a^b^ , {bc)^ca)^{ab)^a^b^c^ , 

and for higher forms. 

Between these five concomitants of a quartic a syzygy exists 

. . . (35) 

again because the square of the Jacobian t is reducible. This 
may be verified by applying the general theorem, or by use of 
a canonical form, say 

/==Z^+6mX2r2_|. y4. . . , (36) 

EXAMPLE 

Assuming a linear transformation x X of modulus Af =f= 0 gives 
+ * • • + = / = X* -f 6mX^ 7* -f prove that 

H = 2 jf»OT + yA , 

\ M / 

/ = Af3 (1 - 9m*) XY{X*- 
t = -f 3m*), 6Af®(m— m®), 

and verify the syzygy (36), 

8. References to Complete Systems. 

For canonical forms when the cubic or quartic ground form is special, 
the reader should consult Elliott’s Algebra of Quantics, The corresponding 
symbolic forms are given by Grace and Young in The Algebra of Invariants, 
where also an account of the complete systems of the binary quintic, 
the sextic, two cubics, quadratic and cubic, quadratic and quartic, and 
also of two ternary quadratics, will be found. 

The septimic and octavic were worked out by v. Gall, Math. Annalen, 
81 (1888), p. 318 and 17 (1881), p. 31, p. 139. 

All these results beyond the quartic case are very complicated. 
There are, for example, 23 irreducible concomitants of the binary 
quintic. This increase of complexity is not entirely due to the 
increase in number of coefficients of the ground form as its order 



THE BINARY QUARTIC 


XV.] 


247 


advances, for certain concomitants actually are reducible for 
higher orders, even , when irreducible for lower. Thus the 
invariant 


A=(a6)2(ac) (U){cdf 


of a cubic is irreducible, whereas the corresponding co variant 
A' -(06)2 {a^){bd){cdfaAcA 

of a quartic can in fact be reduced to a linear combination of 
jf and iH. 

There is a theory of perpetuants which deals with covariants 
of a given degree for forms of order not less than the weight of 
any such covariant. It may be regarded as the theory of binary 
forms of infinite order. It affords a notable example of the 
value of both symbolic and non-symbolic methods of attack. 
For the complete system of such forms may be said to be knowm. 
It will be seen from the examples of symbolic methods in 
§3, p. 216, that any such system so found is comprehensive: 
all irreducible forms are certainly included. But it may contain 
redundant forms. Now the non-symbolic methods proceed in 
just the converse way, and show that any system so found con- 
tains no reducible terms. When the two methods yield the same 
result, as in the case of the binary quintic or binary perpetuants, 
they therefore confirm each other. 

In higher fields complete systems are known in certain cases, ^ 
but apart from linear and quadratic cases the only complete 
ternary system actually computed is that of the cubic by Clebsch 
and Gordan {Math, Annalen, 6 (1875), 436). The ternary quartic 
has received much attention but still remains unworkable.^ The 
problem of ternary perpetuants was solved by Dr. A. Young 
{Proc. London Math, Soc,, 2, 22 (1922-3), 171-220. 

1 A notable instalment was worked by Fraulein E. Noether, Crelle, 134 
( 1908 ), 23 - 94 . 



CHAPTER XVI 


Clebsch’s Theorem 

1. Introdaction of Clebsch’s Theorem. 

The object of the present chapter is to develop the general 
invariant theory as far as the variables are concerned, and the 
principal result will be a theorem due to Clebsch which tells 
us that a completely adequate account of concomitants in the 
field of order n can be given by restricting the choice of ground 
forms to functions of at most n — 1 sets of cogredient variables. 
All other sets which enter can be accounted for by polarization, 
or by the absolute concomitant of the field. 

For example, in the binary field (n = 2) the bilinear form 
a^by may be written 

+ dyh) 4 - - ttyb^) 

= 

Here the first term is a polar of aj)^ which contains only one 
variable, and the second is a product of an invariant (ab) and 
the absolute covariant {xy). Further, this invariant belongs to 
the ground form without the need of the second variable y. 
On the other hand, in the ternary or any higher field (n > 2) 
we obtain / 3 

• • ( 2 ) 

where now the second term is irreducible, and involves a new 
type of variable {xy)}^ which cannot be overlooked; nor could 
it arise if merely one set of variables x was utilized. Also the 
function (ab\xy) is not a polar, but, rather, satisfies the differential 
equation „ 



248 



Chap. XVI.] COMPOUND POLARS 2, 

as is immediately apparent since 

= = ... (4) 


2. Compound Polars. Standard Forms. 

Suppose we have any number of point co-ordinates x, y, 
z, . . . , ... from which the various compound co-ordinates 

TTg, TTg, . . . are derived as in §8, p. 86. We take 

TTg = xy, TTg = xyz, . . . , =^xyz,.,s 

for the second, third, . . . , — l)th compounds, the last repre- 

senting a set of n homogeneous prime co-ordinates u. 

Manifestly a form involving these sets x, tt^, . . . , as 
sole variables is a pol)moniial function of the n — 1 cogredient 
variables a;, y, 2 ;, . . . , s, so that we can write it 

F{x, TTg, TTg, . . . , f(x, y, Z, , . , , S). 


But the conyerse is not necessarily true, as the single example 
already cited shows. Yet if we introduce any number of cogredient 
compoxind variables pg? • • • > Pa-v polarize each 

such form F in every possible way, we obtain a wider choice of 
forms, which may be typified by AJ, such that every form 
f(x, y, . . . ) can be expressed in terms of these ^F: and this in 
fact is Clebsch’s theorem. 

By such polarization is meant an operation on F of one of 
the following types: 


where 




■)> 


• ^ consists of ( 

terms 

077/ ' 

<r/ 

'0 

Pil ij... i,. ^ J 

L,.., ir 

( U... I'r • * • )ti ^2 • • • h' ^ ’ 


the r suffixes taking values 1, 2, . . . , n, and no two in one set 
being equal. 


Example , — 

If Ox^, hx^ aro two quadratics, and x, y, 2, t denote four points, then the 
concomitant | | 

may be regarded as a polar of {ab | namely, it is J 



CLEBSCH’S THEOREM 


250 


[Chap. 


where itg == xy and = zL In this example the polar consists of a single 
term. But more generally the polar of 

(ah \xy)(cd\ xy) 

is (ah \xy)(cd\ zt) -f (ah | zt) (cd | xy)y 

giving an example of a series of two terms derived by polarization. 


Manifestly repeated operation with {pi^ )» 


a form F of order in the variable produces in general a series 
of considerable complexity. Still more so if this can simultaneously 
go on for values of i = 1, 2, . . . , w — 1. Our immediate aim is 
to express any single term of such a polar as an aggregate of 
polars of certain standard forms F together with ‘the absolute 
concomitant which we shall denote by either or E. Although 
at first sight this looks impossible, it can in fact be done, and is 
indeed important for the following reasons: 


(1) Polarization is an invariant process (§8, p. 207, example). 


(2) Any single term of a symbolic expression of any concomi- 
tant, whatever variables ic, y, . . . , w, t;, . . . may be involved, 
is always a term of a polarized standard form F, 


This last follows from the Fundamental Theorem. For if 
each Uy V , , . which appears is treated as an (n — l)th com- 
pound of the cc’s, then each symbolic factor of a term P either is 
free from variables or is an explicit linear function of an rth 
compound (r = 1, 2, . . . , w — 1), say p,. As such it is a polar 

d 

derived from tt,, by the operator p,. . Hence P is certainly 

OTT,. 

a term of a polar of a standard form. 

Example , — 

(ah I zVf (a^yy) (cd | xy) is a term of 

(P2g^-)* {y kz)® (aPr^) (cd| TC 2 ). 

3. Reduction to Standard Form. 

The reduction of any form / to standard form depends upon 
two main ideas, one being the use of the Sylvester fundamental 
identity (§13, p. 93), and the other the theory of adjacent 
terms in* a permanent (§1, p. 14). 



XVI.] REDUCTION TO STANDARD FORM 251 

We take the most general symbolic form as in §2, p. 198, in- 
volving cogredient symbols, and first consider a standard p-ic 
in one variable a?, 

• • ^px (^) 

If we polarize this with regard to p cogredient variables a?,, 
X2, ••• 9 x.p we obtain 

••• ( 6 ) 

summed for the p\ permutations of the p suffixes i, j, . . . , h Here 
on the right is an example of a permanent (§1, p. 14), all the 
signs being positive. Let us call two of its terms adjacent if 
they differ by adjacent interchange of suffixes (§1, p. 14). Then 
any two terms T and T can be connected by a series of adjacent 
terms T^, Tg, . . . , all belonging to S. 

But the difference between two adjacent terms Ta, Ta,^i 
leads to a Sylvester identity. In fact, if i and j are the two 
interchanged suffixes of the terms, we have 

a.ix,ar^j-agx^a,-x = (a^a,\XiXj), . . ( 7 ) 

and the other {p — 2) factors of the terms are common. Thus 

where 11 denotes the other {p — 2) unaffected factors. Hence by 
continued application to adjacent terms 

T- r- (T- ri) + (Ti- ^2) + . . . + {T^-r) 

=-'L{a^a,\xiXj)na^^ (9) 

and by taking T to be each term of the series (6) in turn and 
adding, we have 

p\T--^f^p\ r~-2r = SS(a,^a,|a;,x,.)na^^. (10) 

This shows that, but for a non-zero numerical factor p!, any 
term T of A/ is equivalent to the polar A/ itself together with 
terms like those in the right member of (9) derived by con- 
volution from A /. 

To simplify the notation let us provisionally write | } for 

any factor for any factor (a^a,. | XijX,), whatever a, 



CLEBSCH’S THEOREM 


252 


[Chap. 


may be, and so on for [xiXjXh}, See. We can now consider a 
standard form in two variables x, y of type , 

F= \[{a,ja^\xy)na^= [xy) {a^} ... {a:} {x} ..., (11) 

where there are factors { xy] and factors { x}. 

Any polar AJ^ of this with regard to variables 0^2 •• • 
leads to terms such that the difference between adjacent 
terms gives a Sylvester identity, say 

I P2) I ^2) iP'q^r | ^2) | P2) 

■ =:S(...| XfX^x*)(...| Xi) + S(...| X,X;X*X,), (12) 

where == XiXj^ erg = 

Hence, arguing as before, we deduce that any term of type 

}{***«} ••• (13) 


is equivalent to a polar of a standard form (11), together with 
forms with more than two x’s convolved in one factor. These 
last may introduce a factor { x^ } as in the first term on the right 
of (12); and this, along with the factors |x} {x} ... of (11), can 


be dealt with as a polar with regard to leading, as in 

' dx^ 


(10), to a new factor {x^x}, for which the argument may be 
repeated. 

Combining these results we gather that any term 


{x^XJ){x^Xl) ... (14) 


is equivalent to a polar of (11), together with forms involving 
three or more x’s convolved. 

Proceeding in this way and using the Sylvester identity for 
each further case in turn, with r == 3, 4 . . . , we arrive finally 
when r = n — 1 at the case where n variables x, y, . . . , 
and standard form F are given, such that 

F ==Il{xy . . ,st) n {xy . . . s} . . . n {xy} 11 {x}, (15) 


and any form f involving any nmnber of variables X|, p.^^ erg 

cTp . . . , pn-i, cTn^i is expressiUc as a series of terms AF derived 

from such forms F by ^polarization, 

HjBre, the first factor 11 gives the absolute concomitant of 
the field and all the other factors involve at most (w— 1) variables 
X, y, . . * , 8. This establishes the theorem of Clebsch. 



PROOF OF CLEBSCH'S THEOREM 


XVL] 


253 


In this final formula (15) each factor of the products has all 
its variables x, y . . . explicitly stated. The other symbols are 
implicit. They are the symbols a^, . . . of the original form 

/ in some order or other. Inasmuch as the process of this reduc- 
tion of /to SAJ" is entirely composed of repetitions of the Syl- 
vester identity, which preserves the symbols but only deranges 
them, it follows that any symbols convolved in the original form 
/ are still convolved, implicitly or explicitly, in the standard 
form F. 


Ck>roIlary . — By taking the dual co-ordinates pj = ir„_i we can 
throw any standard form into a product of bracket factors 

F— Y\.{uv . . .wu')VL{ab . . .u)\i(a'h' . . .uv) . . ,I\.{g"uv . . .w). (16) 


There are in fact four ways of writing each factor of F. Thus 
{ah . . .d \ xy . . .z) = (ah . . .d \ 7r„._i) 

== (06 . . . dp,) {ah . . . duv . . . ), (17) 

In practice the process of deriving the standard forms for a 
given expression * . *) is exceedingly complicated except 

in the simplest cases. The present treatment follows the algebraic 
method as used for binary forms.^ The usual treatment follows 
the methods of Capelli who bases all on differential operations 
rather than algebraic permutations. 

4. The Gordan-Capelli Series. 

Let us apply the preceding methods to the form 

/=a/ 6 /c/...rfA . . . (18) 

where there are > 2 cogredient sets x,y,z, , , . First suppose 
k C, n. Then such a form /is a term of a polar of 

a/b’%\..d;, 

» 

and, treated as above, / is equal to a sum of terms where the 
most advanced convolution of the variables is 

{ xyz . . . ^} = {abc . . . ei ] xyz . (19) 

for all the k letters before or after the vertical line must differ, 
so that there is just this one possibility. 


^ Grace and Young, Algebra of Invariants (1903), 42-46. 



2S4 


CLEBSCH^S THEOREM 


[Chap. 


Terms which do not contain K are due to polars of standard 
forms jPq involving k—1 variables, say all but x. So we write 

.... ( 20 ) 

Aq denoting an aggregate of polar operations, and ^ necessarily 
containing x to degree degree ? — 1, and so on. 

Treating <f> in the same way as / we have 

where A^/i has the same general meaning as Ao^-o, and iff is of 
degrees — 2, j — 2, . . . , in the variables. 

Proceeding in this way we exhaust one of the variables in 
h steps where h is the least of the exponents p, q, , . . , s; and 
thereby we obtain the Gordan-Capelli Series, 

/= S Ao^o + ^ 2 Ai S + . . . + ^„F„. (21) 

Here each is a form involving at most {k — 1) different sets 
K is the ifcth compound inner product {ah ..,d\xy 
and A*- is a polar operation. Some of the coefficients of powers 
of K may in particular cases be zero. 

Secondly, if k = n, the expression K is replaced by an actual 

involving the absolute invariant {xy ... t) = E of the field. 
In this case the Gordan-Capelli series is 

/= 2 \Fq +{xy,.. 02 Ai 

-f (a^ . . . tfl. Aa^’j +... + (xy... O^A* JP* 



Here the coefficients of the series are polars of forms Fi involving 
at most {n — 1) sets y, z, . , , , L 


Thirdly, if X; > w no corresponding form K exists, so that we 
/=SAoif’„ (23) 


expressing / as a series of polars of forms involving at most 
(n — 1) sets y,z, ... ,t 

Various alternative expressions of K are furnished by (17). 
In particular, if A: = w — 1, we write u instead of py, where 
the set u denotes a prime, such that 

«i= + (®y...023...». &C-. 



XVI.] THE GORDAN-CAPELLI SERIES 
and the series now takes the form 


/= S A„fo + (oh . . . du)^AiFi +{ab... dufZ^zF^ + 


5. Examples of the Series for Binary and Ternary Fields. 

In the binary field for a form 

/=«*’” V* (»<m) (24) 

with two sets x and y the series was originally given by Clebsoh and Gordan 
as 


/=S 




(I) ® 


(^)* 

m -f w — A; + 1^ (n — k)\ 


1 (25) 


where Dxy denotes the polar operator ^ • 


In this case the coefficient of (xy)^ is the (n — A:)th polar of a form 
Jk == (ahf 

depending on only one set of variables. 


In the ternary field where now ax — the corre- 

sponding series for / — a.c"‘ hy^*' {n i^m) is 


/=S 

^=0 


/m + w — A: -f 1\ 
V k ) 


(abv)^ 
(n — A;) ! 




where = {xy)^^, &c. And more generally the series for 
is /= SAoi'o+ {^2/2)2 Aii’i-f (a:y 2 ) 2 LA 2 J^ 2 H- • • • • 


6.^ Normal Forms. 

Ground forms which can be symbolized as 

(ahe I xyzY {ah \ xyY {a\x)^ i, y, A; > 0, , (26) 

with symbols as well as variables appearing in the characteristic 
standard order are called normal forms. In this example we 
assume w > 3, to prevent the first factor from reducing. 

It is obvious that for a normal form any invariant linear 
in its coefficients must vanish. For every outer product {abc . . .) 
must contain a repeated symbol. 

0 


Again, by polarization with ( 



1 This section may be omitted on a first reading. 



3s6 CLEBSCH’S theorem [Chap. 

manifestly obtain a zero result. This leads to an important 
theorem: 

If, when n > 4, the form f = p >q >r >s, 

satisfies the three differential equations 

= . (27) 

it can he written in the alternative normal form 

C(ahcd I ocyzty (ahc [ xyzY'~^ (ab | {a | 

wh^e C is a numerical constant, and Dy^ = ^ &c. 

Proof , — 

The argument will hold for any number of variables. Consider 
the matrix of polar operations 

h Vt 

Vz ^z 

Vy 

-tx ^x Vx 

where the typical element denotes a polar operation, e.g. 

Since jSa = jS^ya — yaPy holds for any three a, jS, y among 
X, y, z, t, we infer that all elements above and to the right 
of the diagonal z^, y^, x,^ are expressible in terms of these. 
Hence, for reasons used in (23), p. 116, the equations (27), which 
we can call the diagonal equations, imply a whole triangle of 
equations. This is true in general for h variables x, y, , s, t. 

Again, if H is the Capelli operator (25), p. 117, answering to 
the matrix D, its expansion as a determinant consequently loses 
all terms except the leading diagonal 

tt(z^+l) {yy+2)(x,+ ^) 

when it operates on /, if all these elements of the upper, triangle 
are zero. 

But by Euler’s theorem for homogeneous functions 
^xf=pf, Vyf^qf, »zf=^rf, tj=sf. 

Hence Hf= {p + 3) (y + 2) (r + 1) s/, 




XVI.] 


NORMAL FORMS 


257 


Once more, writing H = 'E(xyzt)ij^i(^ 1) and 

operating directly on /, we obtain vu ijki 

pqrs{abcd | xyzl)a^~‘'^hy^'^'^ 

Accordingly 

{V + 3) (g + 2) (r + 1) «/= JT/^, . . (29) 

where 

K = {abed I xya), 

Let the polar operator =t= jS) denote any element of 
(28), not oathe leading diagonal. Then by actual differentiation 
we find, 

iSa Kf, (i3a K)f, + KpJ, KMi. 

so that K commutes with the operator jSa, when acting on f-^. 
Hence KPo^fi= 0, whenever and therefore, by (29), j3a/ 

vanishes. So that Kfy^ satisfies exactly the same relations (27) 
as /, provided that, for the purpose of differentiation, K is re- 
garded as a constant. 

Then. if s > 1, we deduce, similarly to (29), that Kf^^Kc^Kf^, 
and therefore 

/= f, - dr\ 

where Cj, C are numerical non-zero constants. 


^ followed 


Similarly s such operations lead to 
/— Ci(abcd I xyzt) 

where Ci is numerical, and non-zero. 

( I 0 0 0 

xyz • - - ~ - ' 

0 0\ \o X V y 0 Z‘ 

by (q — r) with ixy ) we obtain the desired normal form, 

^ dx dy' 

so proving the theorem. 

EXAMPLES 

1. The necessary and sufficient conditions for / to be symbolized by 
a perfect pth power 

/= {ahe , . , e\xy . . , l)v 
are that it satisfies 2(k— 1) conditions 

I>xy!= Dyzf^ . . . = D,t!= 0. D„S= . ..==Dz,jf= <> 

among the h variables x, y, , . . , s, L 

(D884 ) 18 



CLEBSCH’S THEOREM 


258 


[Chap. XVI. 


2. What are the requisite conditions for a function of n sets of variables 
rr, . . . , f to be a perfect jjth power of the determinant (xy , , . t )1 

[Put A? = n in Ex. 1. 

3. The necessary and sufficient conditions for a symbolic form in two 
sets a;, y to be a perfect pth power {ab | xy)P are that it satisfies two dif- 
ferential equations 

(y|^)/=o. (-^^)/=0. 

4. A quaternary line complex is a form in six variables, Pu> Pia* 
i^ 23 » P 34 » Pa 2 » where pij— (xy)ij» Prove that it can always be symbolized in 
the normal form {abpY^ = (ab | ocyYK 

[Use the differential equations of Ex. 3 on the non-symbolic form. 


7. Historical Note. 

The results given in this chapter cover a long period of study. 
The Gordan-Capelli binary series was first given by Clebsch and 
Gordan,^ and next it was extended to the general case by Clebsch 
and Capelli.2 

These normal forms of §6 were called 'primary cmariants by 
Demyts {Essai , . . . (1891) ), who also studied this general problem, 
although the theory goes back to Clebsch®, Gordan, Mertens,^ 
and Study.® The general theory is given by E. Noether® who 
uses the theorem of corresponding matrices, and by Weitzenbock’ 
who introduces complex symbols. A purely algebraic discussion 
free from differential operators can be based upon the far-reaching 
results of Frobenius ® and A. Young.® 

1 Math. Annalen, 5 (1872), 96-122. 

*For ternary forms, Battaglini, 18 (1880). For w-ary forms, Mem. del. It. 
Acc. dei Lincei (1882), (1891), (1892), and Math. Annalen, 217 (1886). See above, 
p. 264 (21). 

* Oottinget Nachrichten, 17 (1872). Ternary and general. 

* Wiener Berichte, 98 (1899). Quaternary. 

* Methoden, p. 64. Ternary. 

« Math. Annalen, 77 (1916), 93; Crelle, 139 (1910), 118 seqq. 

I nvariantentheorie (1923), V, pp. 121-169. 

* Berliner Sitzungaberichte, 1 (1897); 2 (1899). 

* Algebra of Invariants, Chapter XVI; Proc. London Math. 80 c., 33 (1901) 
and 34 (1903), 228 (1928). 



CHAPTER XVII 


Applications op Clebsch’s Theorem. Apolarity and 
Canonical Forms 


1. Similar Forms. 

When a number of forms 

/i./a. •••./v. (1) 


all have the same sets of variables and are all of the same 
respective orders [p, g, . . .] in these variables, they are called 
similar forms.' For example, we may have a system of ternary 
quadratics, in which case w = 3, p == 2, and one set x of variables 
is used. 

Let N be the number of the coefficients 


• • • 9 


( 2 ) 


in and therefore in each similar form. For ternary quadratics 
this N would be six. Further let A, B, . . . ^ G, H denote these 
coefficient sets of iV + 1 similar forms fi, /g, . . . , /v> /v+r 
From these we can build a vanishing determinant 


Ai Aq 
By B, 


0, G, 


fl 

Bjf /2 


/y 

A,, 


= 0 , 


. (3) 


because the last is a linear function of the other N columns. 
Let the expansion of this by its. last column be 

KiA + KJ,+ ... + - 0. . . (4) 


This shows that, as in §1, p. 73, 

Any N + 1 similar forms are linearly dependent* 

259 



26o applications OF CLEi3SCH*S THEOREM [Chap. 

If the rank r of the matrix of the first N columns is less than 
N, then the forms are not the most general, but can be expressed 
in terms of r forms suitably chosen. If r = 2 each form can 
be written as 

A/+A7' 

in terms of two linearly independent similar forms. Then they 
are said to make a pmcil of forms. For r = 3 they make a net 

A/+A7' + A'7". 

The fundamental property of similar forms is this: 

Each set A, B, . . . behaves like a cogredient linear form in the 
field of category N. 

For if A', JB -> B', ... denote the transformations 

induced by that of the variables a; — > x', then the coefficient 
matrices of these transformations are all precisely the same 
because the sets A, B, . . . are similar. It follows that the 
co-factor of in A, say 

= I A^B ^ . . . I , 

is an invariant, since it is a typical bracket factor for the field 
of category N, So also are each of .... 

2. Types. 

We already know that the Aronhold operator (§10, p. 141) 



produces a concomitant when it operates on a concomitant 
involving A, Indeed the process is analogous to the polar process 
involving variables, and thereby it leads to a theorem, first given 
by Peano, which will be seen to play for the coefficients exactly 
the same part that the theorem of Clebsch played for the vari- 
ables. 

Let all such Aronhold operators involving similar sets of 
coefficients -4, B, C, ... be utilized, and a rational integral 
combination of these acting on a concomitant be called an 
Aronhold 'process. Then every concomitant so derivable from 
one and the same original concomitant is said to be of the same 
type. 



XVII.] 


PEANO’S THEOREM 


261 


EXAMPLES 


1 . Any concomitant can be rendered multilinear (§10, p. 141) in its 
ground form coefficients, by Aronhold processes. All concomitants of 
the same type can be brought to the same multilinear form. 


2. An Aronhold operator Eo;aj . . . acting on a form linear in 

the coefficients . . automatically replaces the actual by the symbolic 
coefficients. One such operator for each coefficient set reduces the form 
entirely to symbols. 


3. Ternary cubics 6a:^ have an invariant of type (a6c)*. 

For each of (a6d)®, (6c6)®, . . . can be derived by an Aronhold process. 


3. Peano’s Theorem. 

It is obvious from (4) that if one of A,; is non-zero, say 
then any form can be expressed in terms of the first N similar 
forms. Each coefficient of fy^^ is given as a rational function of 
those of /i, . . . , fy, with Ky_^^^ appearing in the denominator. 
If, further, the irreducible system, given by Gordon's theorem, 
is known for the N forms, a ratiomlly complete system for any 
extra simultaneous similar forms naturally follows. But we can 
go further and find an integrally complete system in general, 
once we know it for iV' — 1 similar forms : and this brings us 
to the theorem.^ 

Peano’s Theorem. — With the possible exce])tion of the deter- 
minant K, linem in the coefficients of N similar forms each with 
N coefficients, every polynomial concomitant of any number of smh 
forms is expressible by the complete system of ^ — 1 smh forms, 
and by types derived from this system. 

Proof , — 

We regard the concomitant as a polynomial, homogeneous 
in each set of N coefficients A, Selecting the coefficient sets 
of the first N ol N i such forms, we express every poly- 
nomial concomitant as a Gordan-Capelli series (p. 254 (21) ) 

^000 + + • • • j 

where K is the determinant iT v+i these N forms, (f>i is a function 

^ AtH di Torino, 17 (1881), p. 580; D. Hilbert, Schwarz-Festschrift (1914); 
E. Noether, Math, Annalen, 77 (1916), p. 93; for binary forms, see Grace and 
Youns:, pp. 321, 349, 358; Weitzenbock, Invariententheorie, p. 162. 



262 APPLICATIONS OF CLEBSCH’S THEOREM [Chap 


of at most iV — 1 sets of coefficients, and is an Aronhold 
process. 

By the mode of constructing such a series, each is invari- 
antive since we started .with an actual concomitant. Thus the 
series is entirely composed of types belonging to N — I ground 
forms at most, together with K, involving N ground forms 
linearly. Q.E.D. 

In some cases K itself is reducible, as in the binary forms of 
odd order.^ In some it is certainly irreducible as in the case of 
six conics. 

Example , — 

In the quaternary field, a complete study of concomitants is effected 
by confining oneself to ground forms in three types of variable x = uvw, 

p = wv, and u, together with polar forms, while the knowledge of all 
possible types of concomitant for a given type of ground form, say a 
quadratic in x which has ten terms iMijXiXj, is complete if we know those 
of nine quadrics together with the ten-rowed determinant linear in the 

coefficients of ten quadrics. 

♦ 

4. Dual Similar Forms. 

Just as there are dual systems of variables 

Vi • • • 9 '^2* • • • • • • > 1 ] 

and u, V, . . . ,p2, .. . ,p3, . . . , p„_i, ... j ' 


( 5 ) 


SO we may consider certain forms / and to be dual of each 
other. Symbolically we merely have to interchange italic and 
Greek letters a, 6, c, a, jS, y, . . . throughout. 

For example, a/, uj" are dual forms of order p in the 
original variables; {ab | xyY, (aj8 1 uvY* are dual forms in second 
compound variables. More generally 

/= a»a'y . . . (6c I TT,) . . . (de/I TTg) . . . ] 
i> = UaVa'.. . (§y I P2) • • • (SeC I Ps) • • •) 


( 6 ) 


are called similar dual forms when they possess corresponding 
symbolic factors. Manifestly they have the same orders in their 
corresponding variables (5), and the same number N of actual 
coefficient sets, say 

A == [A^, A^ A^] \ 

“ {Pi> Pi • • • » Pa } ^ 


^ Grace and Young, loc, cit. 



XVII.] DUAL SIMILAR^ FORMS 263 

Obviously tho preceding results of this chapter apply to a system 
of similar forms • • • > * 

In general the sets R and A are ^.rbitrary and independent, 
for this duality merely refers to their structure. 

Dual similar forms i = a^^, </> = Uaf possess a simultaneous 
invariant linear in each, namely 

(/> ^)x~ Pi • • • "f" 


where each c,- is the multinomial coefficient occuring in the ith 
term of both / and 

This invariant can be generated non*symbolically by the 
operator 

L+...4- 2 A. 

^dx\dw duj^ dx,^ du,^ 

In fact 


^'OXi OUi 


OXi OU-i 


Hence Q^fj> = jj!p!a/ = ?!/'!(/, <l>)s' 


EXAMPLES 

1, The ternary quadratics ax^ + hy^ -f* cz* + 2fyz + 2gzx + 2hxy and 

Au^ + Bv^ -f Cw^ + 2Fvw + 20vm -f 2Huv have a bilinear invariant 
0 + cC + 2/i^ + 2grG' + 2hH, 

2. Adapt this theorem to binary forms. 

If y = axP is a binary p-ic («o» . . . , Up Xi, a dual form is also 

a binary p-ic but its coefficients are reversed with alternate signs 
changed. Thus 

®0> ^2* ^1* 


are dual sets for binary cubics. For by (49), p. 145, contragredient binary 
variables Wj, Wg are cogredient with x^, —^ 1 , so that the dual form is a 

polarized form of, say, bx^t the polar operator being 

3. For binary forms bxP this invariant is their pth transvectant 
(ab)P = a^bp — jm^bp^i + {^a^bp-^ )Papb^. 


4. Two general similar forms Oxby ...» UaV^ ... 
invariant Uabp ... derived by a product of 





264 APOLARITY AND CANONICAL FORMS [Chap. 

5 . More generally, two dual similar forms involving compounds TUt, pi 

have such an invariant, derived from a product of (operators | . 

6. Derive aa(&c| py)* from (6c 1 7T2)®, WaCPyl ^2)** and write down a 
typical term in non-symbolic notation. 

5. Apolarity. 

Definition. — Two dml similar forms f , ^ are apolar if their 
lineodinear invariant (f, <j>)^ vanishes. 

When two forms/, ^ have this property very many interesting 
geometrical facts can be deduced. We shall confine ourselves 
to one aspect of the case, namely to the discovery of the so-called 
canonical forms. But as a preliminary to this, a few properties 
of apolar forms are useful. 

(i) First, there is one dual form apolar to each of iV'— 1 
given linearly independent similar forms /i, /g, . . . , /^-i; for 
this amounts to giving N — \ linear equations 

^£-^2^2+ • • '“f = 0, . . (8) 

one for each of the iV — 1 sets of coefficients /I, JB, . . . , F. 
The c’s are the same throughout, and the equations determine 
the set p, and therefore the form (f>, to a constant factor. 

(ii) Next, if r of the coefficients A happen to vanish and the 
complementary {N — r) coefficients of (f> vanish, then (8) is 
satisfied, so that / and <f> are apolar. 

(iii) Again, if <f> is apolar to each of f^, / 2 , • . . it is also apolar 
to any linear combination of f^, /g, . . . . Further a linear com- 
bination of r forms /can be apolar to such a combination ol N—r 
forms <f). These results all follow from the condition (8). 

6. Apolarity of Dissimilar Forms. 

When forms/,/', possess the same variable sets but to different 
orders, they may be reduced to a common order by polarization. 
For simplicity let us deal with one variable set x only. Then if 
/= «/>/' = V. q> p, we polarize the latter (q — - p) times with 
an arbitrary cogredient set y and obtain the form /" = 6/ 

similar to / as regards x. 

Let M be the number of separate terms in the non-symbolic 



APOLARITY 


XVIL] 


265 


expression of /" as a polynomial in the arbitrary set y. Then 
is in effect a set of M linearly independent ^-ics in x. Now 
ii N '> M we can find N — M forms apolar to each such 
portion of/''. We therefore take this as the definition of apolarity 
of /' and <l> when the orders differ. Thus, in symbolic notation, 
we infer that 

Two forms f' == b^ ^ == Ua** (q > p) are apolar if the covariant 
ba^ by vanishes identically for all values of y. 

Dually, «/ q < p, f' cind </) are apolar when the contravariqnt 
vanishes identically for all values of v. 


EXAIVIPLES 

1 . Let / = ttx^, 9 - - Ua^ be a ternary cubic and contravariant quadratic. 
Then ax^ ay has 3 terms in y, while has 6 terms: if = 3, ^ = 6. Hence 
three linearly independent conics exist which are apolar to /. 

2 . A binary quartic a-x^ has two terms in y among its cubic polars 
ax^ ayi M — 2, N — 5. 

3 . If f r= /' ~ 9 -- then //' is apolar to 9 if either/ 

or/' is apolar to 9. 

4. The ternary p-io ax^ is apolar to the j:>th power of a linear form 
Uy if ayi* — 0 . 

Geometrically, if a curve of order p passes through a point y, the point 
reckoned p times is a dual apolar form. 

6. If Ox^ is apolar to the {p — 1 )th power of a, linear form Uy then 
az~ 0 for all values of 2:. Geometrically, what is the point yl 

[A double point on the curve. 

6. If Ux^' is apolar to the (/> — l)th powers of Jc different linear forms 
the geometrical locus ~ 0 has k distinct double points. 

This is true for all fields, ternary and higher. 


7. Canonical Forms. 

Let 

• • • > A I , A 2 ) • • • > Ay . (9) 

be the coefficients of a form / before and after linear transform- 
ation of its variables, so that each is a linear function of 
the A's; Thus 

+ . . ( 10 ) 

where each 0^^- is a function of the elements . . . , co,j in the 
matrix M of the transformation. 



266 APOLARITY AND CANONICAL FORMS [Chap. 

Then a very pertinent question arises, how far can these 
A ' *8 be arbitrarily assigned? In particular can some, and if so 
how many, of them be zero? Can we, in fact, throw/ into a simpler 
form by a suitable transformation? 

Answers to these questions are ready to hand. Thus if capital 
letters X, Y, Z, Xi denote linear functions of the variables, and 
in each case the general form is under discussion, then: 

1 . The binury cvbic'^ can he written in the canonical form 
X 3 +Y^ 

2. The binary quintic, + Y® -f- Z^. 

3 . The binary quartic, X^ -f- 6mX^Y^ + Y^. 

4 . The ternary cubic, X^ + Y® + Z® + 6mXYZ. 

5 . The ternary quartic, S2S3 — in terms of three ternary 
quadratics, 

6. The quaternary cubic, + Xg® + X3® + X4® + 

7 . The ternary quartic cAnnot be written as the sum of five 
fourth jKrwers of linear forms. 

Let us typify these problems by stating each in the form 

( 11 ) 

(or F{S) in case ( 5 ) ). Each X is a linear function of the original 
variables Xi; the total number of terms on the left side is N; 
and this is an identity for all values of Xi, Hence N separate 
equations connect the coefficients of terms in x on the left and 
right. 

Let X ~ ayXi + (^2^2 + • • • + ^ =^b^, c^, and 

so on. Here we have n undetermined coefficients for each X or 
y or Z, giving for cases (1), ( 2 ), (6) above 2 + 2 + 2 + 2 , 

4 + 4 + 4+4 + 4 such unknowns. Now these exactly tally 
with N the number of terms in the given form f{x). 

Thus the binary cubic has four terms, and Z® + Y® written 
as (a^x^ + ^2^2)® + (^1^1 + ^2^2)® unknowns a^, ag, 6^, 63. 

In general we can solve these N equations for N unknowns and 
obtain a finite number of sets of values a,-, 6^, ... which reduce 
f(x) to F{x)- This is called the test by counting constants. We 
call all these unknowns, together with further coefficients m in 

1 Note that the form -f- is inadmissible if the cubic has a repeated 
factor. 



CANONICAL FORMS 


XVIL] 


267 


the canonical form, the parameters. Parameters occurring in 
X, r, Z, &c., are implicit] others such as m are explidL 

Examples , — 

Case (3): ^==5=2-| 2-f-l* Here m is the extra unknown. 

Case (5) : = 15 < 6 -f 6 + 6- Each S has six unknown coeflicients. 

Presumably three can be arbitrary. 

Case (7); N ^ 15=3-f3-f3 + 3-f3. This passes the test. 

8. Counting Constants is not Sufficient. 

Historically the ternary quartic, in case (7), provides the key 
to what follows, because at one time it was assumed to fall in 
with the general law. But an easier example of the inadequacy 
of this test is furnished by attempting to write the binary cubic 
Here 2+2 = 4 satisfies the test, but it is insufficient 
because F{X) contains a repeated factor whereas the original 
cubic need not. Something more is required; and it is supplied 
by the Lasker-Wakeford theorem.^ 


The Lasker-Wakeford Theorem. 

A form F(X, m) which contains not less than N parameters 
among the k auxiliary forms X and r explicit coefficients m, is, 
or is not, a legitimate canonical form of f (x) provided there is not, 
or there always is, a form ^(u), dual to f (x) and apolar to each of 
the k + r derivatives 0F(X, m)/0X, 0F(X, m) / dm. 

The forms X need not necessarily be linear. 


Before proving this paradoxical and very curious theorem, 
let us illustrate its scope in cases (3) and (5). For the binary 
quartic 


i 


dF 

dX 


X^ + 3mXY\ 


01 


^ dm 


Treat X, Y as variables, and U, V as duals. 

Now if <f>==B^,V^+B^VW+B^UW^+B^VV^+BJ^ is a 
quartic apolar to X^Y^, then (p. 263, Ex. 8) Bq^ 0. If it is 
also apolar to a cubic X® (whose coefficients are 1, 0, 0, 0) then 

each first polar apolar (p. 265, Ex. 3). 

Hence the apolar condition is 4:U'Bq-\- V'Bi~ 0, so that 

E. Lasker, Math, Annalen, 68 (1904), 434-440. E. K. Wakeford, Proc, 
London Math, Soc., 2, 18 (1918-19), 403-410. 



268 


APOLARITY AND CANONICAL FORMS [Chap. 

Bq, Bi both vanish. If <f> also is apolar to Y®, then B^^B^^ 0. 
Hence an aj^olar form if} does not always exist, for it disappears 
in the case when m = 0. By the theorem this is enough to 
prove a legitimate form. 


EXAMPLE 

Prove 9 non-existent for the cubic X® -f Y®. 

9. Proof of the Theorem, 

Let the parameters v in number be • • • > hi so that the 
assumed identity (11) leads to iV (< v) equations of the type 

.... (12) 

where each/^ is determined explicitly by expanding the canonical 
form F, For instance, and are two of these v parameters 
in the above binary cubic, and 

Then if we can solve these N equations for N of the v 
parameters in terms of the rest, the form F is legitimately canoni- 
cal. If not, F is uncanonical. Now a solution is, or is not, possible 
according as a relation ^(/) — 0 does not, or does, exist: that is, 

•[*] is N, so that at least one iV-row^ed 

does not vanish identically for all .values of its N 


if the rank of the matrix 

S(/) 


Jacobian 


3(0 


parameters then a canonical form exists. 

But in the uncanonical case a relation ^(/) 
values of the v parameters 1. Thus 

l3/.v # 


0 exists for all 


4_ 4^ _L "'-V _ A 

K dfx K 3/2 K s/v ’ 


(r-1,2, ..., v). 


Now this is a lineo-linear relation of type (8) between the 

N coefficients , . . . , It- 1 of ^ form y- and the coefficients 

tdl, dl,] 


/i 

relations (12), this form Xr fs precisely 


1 ^‘1 
Cv dfy j 


And owing to 


of a dual form (f>{u). 

dl, ‘ 

Then unless the v forms Xr apolar form <^(u) for all 

values of the parameters, ^(/) = 0 cannot exist and the form 
F is therefore canonical. 



XVII.] CANONICAL FORMS 269 

Finally, if any of the parameters are implicit, say l=ai 
in a linear form, 


then df(x)ldl = dF(X)ldl= xfiFjdX. 

Hence for each value of i, xfiFjdX is apolar to ^(w). This requires 
dFjdX to be apolar to Conversely if d FjdX is apolar, so 

is df(x)ldL Similarly if X is a form in , x,^ of higher order. 
This proves the theorem. 


EXAMPLES 

1 . A general ternary quartic cannot be expressed as the sum of five 
fourth powers because a quartic exists apolar to the five special cubic forms 
idF/dXi = Xi\ 

Proof , — 

Through five points a conic can be drawn. This conic counted twice 
has these five for double points. Hence by Ex. 6, p. 265, a quartic ax^ 
apolar to five cubes exists, and dually a quartic Ua* apolar to five cubes 
Xi^ exists. 

2. A general binary form of order 2^—1 can be expressed as the sum 
of k linear forms, each raised to the same power 2A; — 1. 

3. Any binary p-ic /, apolar to every p-ic 9, which has a linear factor 
X repeated X times, must itself contain this factor X repeated p X H- 1 
times. 

[Combine Ex. 3, p. 263; (iii), p. 264; and §6. Treating X, and another 
linear form Y, as new independent variables, then the last X coefficients of 
9 are zero : whence the last p — X 1 of / must also vanish. 



CHAPTER XVIII 


Invariant Equations and Gram’s Theorem 


1. Expression of a Gradient by Coefficients of Govariants. 

Let/(i4, x) typify one or more ground forms whose typical 
coefficient A is symbolized by a product whose 

transformed coefficients are indicated by an accent. Then (§6, 
p. 202) a single-term product P' of coefficients A' is symbolized 
by a product of factors a/, and therefore of af , . , . , . . . . 

Now consider the n columns , a>, of the matrix M 

which transforms a; to a;', as a set of n cogredient points, then 
P' is at once the symbol of a concomitant for the ground forms 
and these n points, because it consists entirely of inner products 
such as Further, let P' be expanded by a Gordan-Capelli 
series as 

= S Ao Po + 1 M i S Pi + I . M 1 2 S Aa P 2 + . . . ( 1 ) 


where each S denotes a concomitant, and A^ is an aggregate of 


polar operators (rj &c., involving pairs from among 17 , . . . ,a>. 
\ 9 ^/ 


Each Pi involves (w — 1 ) of these cogredient sets, and | M | 
denotes , ci>). We provisionally call each P^ a covarianL 
For binary forms (n == 2) this has the usual meaning, because 
Pi has now only one set 

Identity ( 1 ) is true for all values of ^ 1 , , ca„. Taking the 

unit matrix in particular, when == ^2 = . . . = co,^ = 1 and all 
the rest vanish, we obtain aj = a/ = ai\ so that each P^ becomes 
a coefficient in a certain covariant Pi, \ M \ becomes unity, and 
P' becomes the original product of actual coefficients A (cf. 

§ 1 , p. 226). 

Finally, if we select a number^of terms P, isbbaric and there- 

270 



ZJl 


Chap. XVIIL] EXPRESSION OF A GRADIENT 

fore homogeneous, in each -q, &c., and add the results, we obtain 
the theorem: 

Any gradient can be expressed as the sum of coefficients in 
covariants involving n — 1, or fewer cogredient variables. 

Examples , — 

1 . For binary cubic forms a„ a^, x^, *2)3, (60, b^, 63 K x^, 

let P ~ a^bi. Then 

2P' = 

2 . Let 9(a) be a polynomial of binary coefficients snch that when 

9 (a) = 0 so also 9 (a') = 0 for all values of 5i> ^2> ^2* By taking the 

diagonal matrix = tj, = 0) show that 9(a) must be isobaric, 

3 . The Gordan-Capelli series for 9 (a') is 

9 M = AoPo -f + . . . 

where the typical co-factor of is a polar of a covariant, say Except 
for a numerical factor the typical term is Putting 

= YJ2 = 1, ^2.= = 0, then 9 (a') = 9(a) and the typical term is 

which is a coefficient in the covariant 
Hence 9(a) = 0 and also 9 (a') = 0 for aU transformations, a certain 
set of covariants Po, Pv . . , must vanish identically, 

2. Invariant Equations. 

Suppose a relation <f>(A) = 0 to exist between the coefficients 
of one or more general ground forms, in such wise that exactly 
the same relation ^(^') = 0 exists for the corresponding co- 
efficients after an arbitrary linear transformation. Then this 
relation is called an invariant equation. Geometrically, such rela- 
tions are called projective relations. 

It is clear that this is the kind of result which frequently 
occurs in analytical geometry (cf. p. 132); but it is by no 
means obvious that the case of such equations is covered by our 
invariant theory, because the condition I(A')= | M \^I{A) of the 
latter is more stringent than to say <i>(A) == 0 = (l>(A'), Never- 
theless they are closely related, as the following theorem 
demonstrates. 

3. Oram’s Theorem. 

If an invariant equation, exists it bdongs to a system of m such 
equations which specify that the m coefficients of a certain covariant, 
in n or less variables cogredient with x, vanish, Ifm=^\the equation 
specifies the vanishing of an invariant. 



272 INVARIANT EQUATIONS [Chap. 

Conversely^ if a comriant vanishes identically it continues to do 
so after linear transformation. 

Proof — 

This converse is obvious, while the direct theorem follows 
from the result of the last section. 

For if <I>(A) = 0 is an invariant equation then by definition 
vanishes identically for all values of the transformation 
coefficients . . . , We construct ^(4') as a function of 
the 4’s and the ^’s, t^’s, . . . , expressing it in its lowest terms as 

= + • • • + ( 2 ) 

where each Ci is a function of 17, . . . only. As <I>{A') vanishes 
for all values of . . . , it follows that 

<i>M) = 0, <f>M) - 0, . . . , - 0. (3) 

Thus we deduce m linearly independent conditions as a con- 
sequence of an invariant equation ^(.4) = 0. Interchanging the 
rdles of A and A' we deduce from the inverse transformation, 

^i(4') = 0, <^2(^') = 0, ...., <f>JA^)^0. (4) 

Thus we have a system of m invariant equations. But so far 
we have not shown that they include (f>(A), This follows by 
substituting ^i==t 72” • • • ” ~ from the identical trans- 

formation, making <f> (A') == ^ (A), For now (2) becomes 

<f>(A) = Xi<l>i(A) + A2^2M) + • • • + A,,, <^,,,(^4), . (5) 

where each A is numerical. 

Further, by making the transformation matrix M a diagonal 
matrix (Ex. 1, p. 101) with zeros everywhere but on the diagonal 
li, • j substituting these values of , co in (2) 

we infer that parts of homogeneous separately in each 

77, . . . satisfy the typical invariant equation condition. Hence 
we lose no generality by assuming ^(4) is homogeneous and iso- 
baric in A, In fact <f){A) is a gradient. 

Finally by the Gordan-Capelli expansion for <j>(A') of the 
preceding section we have an explicit form for condition (2), 
which at once shows that the m functions <l>i{A) are the several 
distinct coefiicients in a covariant, or combination of covariants. 



XVin.] GRAM’S THEOREM 273 

In particular if m = 1, then, by (6), <I>{A) = and by 

(2), <f}{A)=^ showing at once that (f>{A) is an invariant. 

This proves the theorem. 

4. Orace’if Theorem. 

Quite recently Mr. J. H. Grace ^ has developed this theory 
with more particular reference to these vanishing covariants. 
The question which is put now runs as follows: 

What is the most general polynomial <^(A) of degree i in the 
coefficients A which vanishes when the ground forms have an assigned 
projective property? 

The answer is simple, namely: 

^(A) is the sum of a number of parts each of which is a coefficient 
in a covariant of the forms, and all such covariants vanish in virtue 
of the assigned property. 

Such covariants have already been found; they can be taken 
as linearly independent of degree i. But it can further be proved 
that the coefficients themselves are linearly independent. In 
fact, there cannot be a linear relation between any coefficients 
of a set of linearly independent covariants, for if there were, 
the operation of writing aj for a^, for ag, &c., would immediately 
give a linear relation between the co variants themselves. 

This theory is applicable to forms of all types already con- 
templated, including multiple fields. Little or nothing is known 
about these last, and in fact the whole subject presents many 
opportunities for further investigation. 

Examples . — 

1, To find the necessary and sufficient conditions for a ternary cubic to 
he a perfect cube. 

Let ax^ = bx^ be the symbolic form of the cubic which is to be a per- 
fect cube (PiXi-{- P 2 X 2 + Pa ^ 3 )® where the coefficients pi are actual numbers, 
and aijk is the actual coefficient in non-symbolic form. Thus we have a 
number of non-symbolic equations 

aijk = PipjPk {h j, A; == 1, 2, 3). 

This is secured by eliminating Pi, p^y Pa in every possible way, giving 
ciijk^ = aajajkk > amajjj = aujaijj . 


D884) 


1 Journal London Math. 80 c., (1928), 34-38. 


19 



INVARIANT EQUATIONS 


[Chap. 


Symbolically this gives two types of condition 

= d -^ d ^ • 62^3® > 0^1® • 63^ d -^ d ^^ • 6^62^. 

The first leads by Gram’s theorem to the covariant 

dxdydz bxbybz — dx^dy hyhz^ = 0>X^y bybz [dzbx ““ (fxbz] == 0 . 

. If Ui = (zx)jk we can write this 

(dbu) dxdy by bz , 

and by interchange of equivalent symbols this becomes 
i (abu) (db \ zz) dy by = J (dbu) (dbv) dy by , 
where v is cogredient to w. This concomitant is a polar of 
i(dbu)^axbx = J0. 

The other condition gives the covariant 
~ (^^CLybxbi/ — ax^by^{ab\xy) 

= \(dbu) (dx^by^ — dy^bx^) = \(abuY (dxby + dyhx), 

which is a polar with regard to y of the same form 0. 

Hence the required condition for dx^ to be a perfect cube is that the 
mixed concomitant 0 should vanish identically. 

2. The cubic in n variables is a perfect cube if {ab | xyYuxbx vanish 
identically. 

3. The quadratic dx^ in n variables is a perfect square if (db [ xyY 
vanish identically. 

4 . The binary n-io is a perfect nth power if the Hessian vanish 
identically: and consequently all its concomitants except itself vanish. 

5. For the binary n-ic which contains a factor repeated n — 1 times 

all covariants of grade four, i.e. such a^ contain the symbolic factor (a6)^, 
vanish. (Grace,) 

6. Let /= dx^^ == bx^^ be a binary n-ic. Show that a complete set of 
covariants of degree two is 

/a, Hi = (ab)^ax^^-nx^-\ = (ab)^ax^^-^bx^^-^, 

Hg = &c. 

7. If lx is an actual linear factor, repeated n — i times in /, so that 

/ = 0 mod 

then all covariants of degree two vanish except 

/*, Hi, H 2 , ...,Hi. (Groce.) 

5.^ Invariants as Elimination Results. 

Let/ = a/ be an w-ary y-ic whose actual coefficients are now 
written 

^2> • • • > (^) 

Further, let the linear transformation x', of matrix 


' Gram, Math, AnTmlen, he. cit. 



XVIIL] ELIMINATION AND INVARIANTS 


275 


induce a transformation a — > 6 on these coefficients, so that the 
new coefficients are given by N linear equations 


— ^ (7) 


Then, if / has two polynomial invariants 7(a), K(a) of the same 
weight, they give rise to an absolute invariant 


Thus 


i{a) = 


K(a) 


( 8 ) 


7(a) 

K{a) 



or I(a)K{h)^K{a)I{h)^0. 


(9) 


This last is a polynomial equation in the coefficients a,-, \ which 
by definition must vanish identically for all values of when 
each is expressed in terms of the original a’s. In other words 
(9) is the result of eliminating the coefficients from the N 
equations (7).^ Each absolute invariant is an elimination result, 
or let us simply say a resultant, of the system (7) regarding ejj as 
the variables. 

For this to happen in general the number N of equations 
(7) must exceed n^, else the e’s cannot be eliminated. So we 
assume N > n^. But we can go further and prove conversely 
that 


Every resultant derived by eliminating the n^ coefficients ejj 
from the N transformation equations furnishes, either a system of 
invariant equations for the coeffiicients a^j of the ground form I, or a 
relation i (a) = i (b) expressing the equality of two absolute invariants. 

Proof , — 

Let such a resultant be expressed as a polynomial in each 
a^j and 6;^ as 

R{a, 6) = 0 (10) 


There are two cases to consider. Either one set of coefficients 
is absent from R, or both are present. 

First we can take R to be 72(a), excluding h. By interchanging . 
a, h and using the inverse transformation and carrying out 
identically the same steps we should have arrived at the analogous 
result 72(6) = 0. Hence jR(a) = 0 belongs to an invariant system 



INVARIANT EQUATIONS 


276 


[Chap. 


of equations, and by Gram’s theorem certain invariants or co- 
variants of / vanish identically. In this case / is not a general 
M-ary y-ic. 

Secondly, 22 contains coefficients of both types a and 6. We 
write the resultant explicitly as 

22 = ^o'5o + ^i'5i+... + ^;B,= 0, . (11) 

where, for each value of i, Ai is a homogeneous polynomial of 
degree i in the coefficients a, and B; likewise in h. Further we 
suppose each such resultant to be in its simplest terms and 
irresoluble into factors. 

Let a new linear transformation induce the coefficient 

transformation a c of matrix Then between sets (a) and 
(c) there will be a corresponding resultant 

E = + . . ( 12 ) 


each Ci being the same function of (c) that Bi is of (6). And 
since (a), (6), (c) are connected linearly, there will be a matrix 
J for the direct transformation 6 c, with its corresponding 
resultant R” ^ . . . + 0, . . (13) 

where each 5/ is analogous to the original A^. 

Solving (11) and (12) for and equating the results, we have 

i = A' %'+■■■+ a; ?■, ( 14 ) 


B. 


B, 


Gn 


which along with each preceding relation must be an identity 
for all values of the elements e^j, Then if we express 

each Ci as a function of (6) and {6), the result (14) gives a relation 
between sets (a), (6) also involving an arbitrary set (0). As the 
a’s only enter (14) by way of the v homogeneous polynomials 
Ai, and, by (7), the 6’s are arbitrary, it follows that the co- 
efficients of A{ are equal on each side. Thus 

^ 

Co’ B, Go’ ‘ 

A. B 

Hence by interchanging the roles of (a) and (c) we have = . i 
1 = 1, 2, . . , , V, each of which is a relation of type 

f(«) HbV 



XVIIL] THE EQUIVALENCE PROBLEM 277 

In other words it is an equality between two absolute invariants 
i{a) == i(b). 

Corollary I . — The numerator and the denominator of an 
absolute invariant <f>{a>) j ^jf(a) are relative invariants. 

For if we write the absolute relation as 

(f){h) = p<f>{a), ili{b) == p0(a), 

then ^(6) and ip{b) are polynomials in both sets a and e^j. Con- 
sequently p is a rational function 

]){a, e) 
q(a, e) 

certainly involving the set e and possibly a. If we suppose pjq 
to be in its lowest terms, then, in order to make p^(a) a poly- 
nomial, q(ay e) must be a factor of (f>(a), and similarly of i//(a). 
This is impossible, if <^(a) / i/j(a) is originally in its lowest terms, 
unless q(a, e) is a mere constant factor. Hence p is a polynomial. 

Further, since each 6 is linear in the set a, (f>{b) and <f>{a) are 
of the same degree in a. Hence the degree of p is zero, so that 
p depends solely on the set By the definition (§2, p. 169) 
this proves the result. 

Corollary II . — The theorem holds for any simultaneous system 
of ground forms, 

6. The Equivalence Problem. 

When a linear transformation with non- vanishing 

determinant | e^ | = | M | turns a form /into/' and consequently 
the inverse transformation x' — > x turns /' into/, the two forms 
are said to be equivalent. Manifestly if / is equivalent to /' 
and/' to/" then /is also equivalent to/". 

AVhen the transformation changes / to p/' where p is a non- 
zero constant, the equations f= 0,/' = 0 are said to be equivalent. 
It is easy to adapt this last to the original case by multiplying 
oach eij by the same constant JJ/p. 

The results of the foregoing sections show the necessary and 
sufiicient conditions for two forms to be equivalent. The co- 
efficients (a) and (b) of the equivalent forms must either both be 
general, or else both satisfy the same particular conditions, and 



278 INVARIANT EQUATIONS [Chap. 

also their corresponding absolute invariants must be equal. By 
Gram’s theorem this first entails the identical vanishing of the 
same covariants. 

7. Extension of Stroh’s Lemma. 

Recently Mr. J. H. Grace ^ has used the preceding methods 
of canonical forms, with great effect, for theorems which used 
to be very difficult to prove, though of considerable importance 
in applying the fundamental identities 

(hc)a^ + {ca)h^ + {ab)c^ == 0 
{bed) ay. + {cad)by. + {ohd)Cy. + (bac)dy. = 0 

to binary and ternary forms. 

For if Aj, Ag, . . . , A,, are positive integers, each not exceeding 
p, such thjQi 

Ai + Ag + • • • + — 1) (j^ + 1)> ^ > 2, (1) 

then a canonical form of f (|), any general or special binary p-tc in 

'' m = + Zg^*Pg + . . . + z/^p„ . (2) 

where X^, Xg, ... X,. are r given distinct linear forms in 
and Pf is a form of order p — A^-. For a given set of X’s this set 
of P’s is unique. 

Proof — 

Regarding the coefficients in P^ as intrinsic parameters of 
the proposed canonical form, we note that P, supplies pi such 
parameters where = p — A^ + 1. So the P’s supply in all, 

S(p-A<+l) = f{y+l)-SA, = p+l, . ( 3 ) 

1 

a number which tallies with the number of coefficients in /(^). 

Again, since (2), when written in full, is linear in these para- 
meters, not only does it provide p + 1 equations for the p + I 
parameters, but the equations also are linear. Hence the 
canonical form is unique, if it exist. 

r 

Now suppose ^( 1 ) is a binary p-ic always apolar to 
1 Proc, Cambridge Phil, 80 c,, 24 ( 1928 ), 21 §-^ 222 . 



XVIII.] EXTENSION OF STROH'S LEMMA 279 

Then, if. all ■}■ 1 coefficients in each P,- are arbitrary, 

<l>(i) will be apolar to X>‘P,. singly, and therefore (Ex. 8, p. 269) 
will have X/* as factor. Hence <f>{$) will contain distinct factors 
■^i**** ^ 2 ^' > • • • giving by (3) a total order greater than p, 
its own order; which is impossible. Consequently no such apolar 
form exists, and the forms P^ are entirely linearly independ- 
ent, so that the canonical form (2) is justified. 

Corollary I.— If r = 3, we obtain Strok's lemma: 

If i are three qmntities whose sum is zero, and A, /x, v three 
'positive integers ( < p) such that 

A -*f- /Lt -i" v= 2p -|- 2 

then any homogeneous polynomial of order p in r], ^ can be 
expressed uniquely in the form 

For let 7}, ^ 7], and r = 3 in the 

above theorem. 

Corollary II. — Further, if as is possible, A > f jt?, /i> |p^ 
^ > §p we obtain what is known as Jordan^ s lemma. 

Similar methods apply ^ to four quantities t], co, whose 
sum is zero. Any p-ic in $, tj, co can be expressed (not 
necessarily uniquely) as 

ep+ri^Q+i:^ii+<^^s 

where A + p-f p = 2p + 3. For five variables 2p + 3 
changes into 2p + 4, and so on. 

EXAMPLE 

Prove the identity 

S{bc)(ca){ah)ax^bx^Cx^ — (bc)^ax^ 4* {cdfl>x^ + (ahfcx^. 


1 Loc, ciU 



CHAPTER XIX 

Geometrical Interpretations of Algebraic Forms 

1. Homogeneity and Correspondence. 

In Chapters I and V allusions have been made to Cartesian 
and homogeneous co-ordinates. We now seek a closer connexion 
between the geometry and the algebra. The straight line, with 
its totality of points, illustrates the binary theory — a finite set 
of n points picturing the binary n-ic — while the points of a plane 
illustrate ternary forms, the points of threefold space, quaternary 
forms, and so on. Moreover it is worth while pondering for a 
moment on two relevant ideas which help to make the general 
setting of the theory a little clearer. One is the idea of homo- 
geneity, and the other is that of correspondence, 

(1) The practice of centuries in algebra has made it abundantly 
clear that the homogeneous polynomial, or form, is much easier 
to handle than the non-homogeneous. And although in ordinary 
Cartesian geometry a start is usually made with the non-homo- 
geneous expressions, as in the general equation of the second 
degree for a conic, in the natural, but inflated, hope that such 
a course is more comprehensive and general, very soon we return 
to the homogeneous once more, either by paying attention to 
the terms of highest degree 

ax^ + ^hxy + by^, 

or by introducing a third variable z, and considering what is 
in effect the ternary quadratic form 

S = ax^ + by^ + cz^ 4* + 2hxy 

= (a, b, c,f,g, hlx,y-, z)K 

A return to Cartesian co-ordinates can then be made at any 
moment merely by putting 2 ; = 1. 



HOMOGENEITY 


281 


Chap. XIX.] 

Likewise the general polynomial F of order in n — 1 vari- 
ables can be written 

F=U,-i-U,+ ...+ V„ 

where 'each term 17 is a homogeneous form in the variables, of 
order indicated by the suffix. So if Hs a new variable, this form 
is made homogeneous in n variables by writing 


taking F as the special case of this, when < = 1. So in discussing 
the n-ary form — the polynomial homogeneous in n variables — 
we are including the apparently more general non-homogeneous 
form. 

It is only when J7o -f- -f- . . . -f is an endless series 

that homogeneity breaks down; and it is just here that we step 
over the clear border line between algebra and analysis. Algebra 
is, in fact, the study of the finite, wherein it is totally different 
from elementary arithmetic on the one hand and analysis on the 
other. Thus at once we begin to express ourselves algebraically 
when we write x, a single finite symbol for the endless choice 
of positive integers which form the ambitious subject matter 
of elementary arithmetic. 

It should, however, be remarked that, in analysis, homogeneity 
can be retained; bxit only at the expense of another algebraic 
feature — the polynomial. For example, the series 


1 + 


X 

y 




is homogeneous in two variables x, y\ and each term is rational, 
but not integral. 


(2) Correspondence , — This is perhaps only another name for 
the same idea. Just as x is one symbol for a numerous class 
of things, so an algebraic matrix, form, equation, or syzygy is 
one symbol for many pherfomena 4n quite divergent fields of 
thought. A very simple familiar example of this is given by the 
two geometrical figures on the following page, one with points 
in line, the other with coplanar lines through a common point. 
As objects to be gazed upon what could be more different? 



282 GEOMETRICAL INTERPRETATIONS [Chap. 

Yet they contain the same idea; and algebraically they are 
implied by the same symbols. 

In short it may be said that there is not merely one but a 
definite system of geometrical and even physical phenomena 



associated with each algebraic statement, and no hard and fast 
rule holds for the geometrical interpretation of the algebra. 

2. Principle of Duality. 

The above two figures, of points in line, and lines through a 
point, illustrate in geometry what is called the principle of duality 
or reciprocation. This principle immediately fits in with the duality 
already seen in the algebra — in the rows and columns of a matrix, 
in the theory of reciprocal matrices and determinants, and briefly 
in all that is comprised in the terms cogredience and contra- 
gredience. It amounts to this: that just as the algebra attained 
greater richness and completeness by the use of two sorts of 
variables x and u, contragredient to one another in a field of order 
n, so also geometry in any number of dimensions (say n—\ 
dimensions as equivalent to a field of order w), becomes more 
intelligible by the use of two sorts of elements — points and primes. 
A prime is a space of (n — 2) dimensions relative to the field of 
order n. Prime is a useful word because it covers various cases: 
thus a prime is a line in a plane field (when n = 3); it is a plane 
in threefold space (when w = 4); and so on. Then it is found that 
every geometrical property of points in the field can be matched 
by a corresponding property of primes; and such are called dual 
or reciprocal properties. At present it is enough to notice that 
there are two kinds of reciprocal properties, the one arising from 
the very fundamental texture of space and the second arising 



PRINCIPLE OF DUALITY 


XIX.] 


283 


from a ground form P, now interpreted as a geometrical locus, 
such as a conic, for which pole and polar elements exist. 

To illustrate these remarks consider the ternary case. A 
triangle ABC may be thought of as a set of three points A, B,C 
or a set of three lines BG, CA, AB. In this we have an example 
of the first kind of duality. As further instances of such properties 
we can set, side by side, the facts: 

Two sides a, h of the triangle ABC Two points i? of the triangle abc 
pass through one point C. lie on one line c. 

But we obtain similar dual results by taking a conic F in 
the plane of a triangle ABC and forming the polars of the points 
A,B,C with regard to the conic. This usually gives a new triangle, 
say a'6'c', where a' is polar of A, V of JS, and c' of C. It is now 
quite easy to write down dual properties, the one holding for 
the triangle ABC, and the other for a'6'c'. 


EXAMPLES 


1, The binary form (do, , ap \ x^., x^v equated to zero, represents 
p points on a line, points which may be real, coincident or complex. Each 
root for Xi : iCj gives one such point P, and Xi : x^ may be interpreted as a 
ratio determining the position of P relative to two fixed base points By, 
B 2 of the line, in the^ familiar elementary way. 


2. Non- homogeneously, if Xy^= x, arg = 1, we interpret this binary 
p-ic by the use of a Cartesian co-ordinate x, relative to a given origin 0 
on the line. 


3. Four binary linear forms ax, bx, Cx, dx have an absolute invariant 

This is called the cross ratio or anharmonic ratio of the four forms. By 
interchanging a, b, c, d in all 24 ways, derive six cross ratios* 


Ans. k, I — k. 


1 1 
k' i-k' 


1 - 


k 

k-l 


These follow by using the identity (6c) {ad) -}- (ca) {bd) -f* (a6) (cd) = 0. 
4 . Prove [ abed } = [ edab J = { deba ] = { bade } 


3. Prove that the operations of deriving the remaining five from any 
one of the above six cross ratios form a group. 

6. Prove that { abed ] denotes the g^metrical cross ratio of four points 
in a line. Here == 0 gives the point (dg, — d^), in homogeneous co- 
ordinates. 


7. Examine the special cases when (i) k= 0, 1, co ; (ii) k = — 1, 2 or J. 

[(i) Two points coincide, (ii) The range is harmonic. 



284 GEOMETRICAL INTERPRETATIONS [Chap. 

8. If the roots of the quadratics a®* — 0, hx^ ~ 0 denote two pairs of 
points, prove that they separate eacli other harmonically if (at)® — 0. 

Non-symbolically if -f- 2a = 0, then + 2aiX-\- 

and h^x"^ -^2hiX determine harmonic pairs of points. 

9. The Jacobian (ab)axbx = 0 determines two 'points K, L which are a 
harmonic pair, simvJkaneously for the pairs P, Q and R, S, given respectively 
by ax* = 0, bx* = 0. 

10. If (be) (ca) (ab) = 0 (§4, p. 218) the quadratics a^*, bx^y Cx^ are each 
harmonic to. a common quadratic jar** 

11. All quadratics of a pencil X/+ X'/' have a common harmonic 
quadratic. 

Ans. The Jacobian (/, /'). 

3. Further Binary Results.^ 

A second interpretation of binary forms dual to Example 1 
above is to treat the variables as ordinary Cartesian 

co-ordinates and the binary n-ic as representing n straight lines 
through the origin. In this case two quadratics represent two 
pairs of lines through a fixed point, and the vanishing of their 
simultaneous invariant now gives the necessary and sufficient 
condition for these pairs to form a harmonic pencil. 

EXAMPLE 

Prove that such pairs of lines meet any arbitrarj’^ line, which does not 
contain the fixed point O, in a harmonic range. 

A third interpretation of binary forms is to treat the w-ic 
as representative of n points on a rational plane curve, by taking 
the variable x^ : X2 as the parameter of a point of the curve. 
For example, if (A, Y) are Cartesian co-ordinates, then the 
relations 

X:Y :1 — x^: x^x ^ : x.^ 

determine the conic X = Y^, and a binary n-ic in x^, x^, equated 
to zero, gives n points on the conic. If ternary homogeneous 
co-ordinates X, Y, Z are used, the conic XZ — Y^ has the 
parametric equations 

X:Y :Z=^ x^\ x^x ^ : x^ (1) 

It can be proved that projective properties of points on the 
conic correspond to binary invariants and covariants. 

^ For further treatment, the reader should consult Grace and Young, Algebra 
of Invariants, Chap. X. 



XIX.] 


BINARY INTERPRETATIONS 


285 


A fourth interpretation of binary forms is the dual of the 
third. The n-ic* represents n tangents to a rational curve. Thus 
if TJX VY ^ \ is the equation of a line in Cartesian co- 
ordinates, [7, F are called line- ot tangential co-ordinates. Then 
if 


U:V:l==Xj^ 


X1X2 : X, 


2 

2 y 


the line touches a conic whose tangential equation is 17 = F^, 
and whose point equation is 4:X 0. Similarly for homo- 

geneous line co-ordinates. 


EXAMPLES 

1. It X:Y:Z 

— 2hxiX2-\- hx^ : a'x^^ 2¥xiX2 + h'x2^ : a^'x^^ -}- 2h"XiX2 4- h'^x^y 

show that a ternary linear transformation X, Y, Z — > X', 7', Z' in general 
exists such that X' : 7' : Z' = x^^ : 

Show that both points (X, 7, Z) and (X\ Y\ Z') lie on conics, for all 
values of x^, a;g. 

2. If X : Y : Z — : (VS then (X, 7, Z) lies on a plane curve 

of order w, which is rational, 

[The line TJX -f- F7 -{- WZ = 0 cuts the curve in n points. Rational, 
because this parametric form is rational.] 

A fifth interpretation is to consider the n roots of a binary 
n-ic to be points in the Gauss plane. This method has the advan- 
tage of giving a real geometrical figure for complex binary forms. 

Lastly a sixth interpretation, and probably the most profound, 
is by means of the norm curve in space of n dimensions of which 
the plane conic (1) illustrates the quadratic case. By this is 
meant; taking 

Xi : X 2 : . . . : = x ^^ : x^^ ^ Xij,: . . . : x^^y (2) 

whefe (n+ 1 ) co-ordinates X^, Zg, Z,^^! are called the 
homogeneous co-ordinates of a point in n-fold space. We gain 
a hint of its possibilities by noticing that, if n^ 3, a point 
(Z^, Zo, Z 3 , Z 4 ) of ordinary three-fold space lies on a curve 
which meets an arbitrary plane 

A^Xi -f- A2X2 + AqX^ + A^X^^ = 0 


in three points, if its co-ordinates satisfy (2). For the ratio 



a86 GEOMETRICAL INTERPRETATIONS [Chap. 

Xi : then can only take three values, which are the roots of 

a binary cubic 

+ A^x-^x^ + A^x^x^ + A/^x^ == 0 . 

Such a curve is called a twisted cubic. If we think of this norm 
curve as fixed in space, then each binary cubic is associated with 
a plane in space. 

For example,, if a binary cubic has a repeated factor its plane 
touches the cubic curve; if it is a perfect cube, its plane osculates 
the curve. 

4. Connexion of Binary with Higher Fields. 

If, for ternary forms, y and z are two distinct points 

2/2. 2/3}. {21. 22. 23}, then i^i 2 /i + ^22t, ^12/2+^222. 

+ ^2^3} the co-ordinates of any point X in the line yz. 
Further, if = 0 is the equation of an arbitrary line, then the 
point X lies on it if 

«l{fl 2 /l + ^ 22 i) + M 2 (^i 2/2 + ^222) + W 3 (llS '3 + ^^223) = 0 , ( 3 ) 
which can be written shortly as 

+ (4) 

Similarly if a/ == 0 denotes a curve of order p, the line yz cuts 
it at points Z, for which ^2 ^^e given by the binary p-ic 

i.e. + + + = 

It is essential to notice that formulae ( 4 ) and ( 6 ) are precisely 
the same if we start with two points y, z in space of any dimension; 
for the extra terms in ( 3 ) are implied by the inner products 
Uy, ay, &c., of ( 4 ) and ( 5 ). 

The theory of tangents and polars of conics, quadrics, and 
loci of higher orders can be readily deduced from ( 5 ) by the 
ordinary elementary methods. 

0 

EXAMPLES 

1. The equation of the tangent to a conic a** ~ 0 at the point y on the 
curve \a axOy^ 0. 

[Put p == 2, = 0 in (6), and make the two roots for ; 5a equal. 

^S* , The polar of y is 0 ®% == 0. 



XIX.] CLEBSCH TRANSFERENCE PRINCIPLE 287 


3. The tangent prime at the point y to the quadric Ux^ = 0 in general 
is ax ay = 0. This equation also gives the polar of y, 

4 . If ayaz = 0 the points y, z are conjugate. Prove that the line yz 
cuts the conic (or quadric) in two points harmonically separating y, z. 

5. The line yz touches ax^ = 0 if ay^hz^ -- ayazhyhz vanishes, this 
being the discriminant for equal roots J Sg- The symbols a, h are 
equivalent. 

This reduces to (ah | yz)^ = 0. 

6. In ternary forma if u ^ yz, then Ux = (xyz); and Wa; == 0 or 
(xyz) = 0 is the equation of the line yz. Prove that this line u touches the 
conic f — ax^ — bx^ = 0, if (abu)^ = 0. 


8 

7. If f — "L aijXiXj, aij—aji, then 
i,j~l 


• 

Ml 

Mg 

Ml 

<*11 

ai2 

Mg 

Mgi 

ttgg 

Mg 

^31 

<*32 


M3 

®13 

®23 

®33 


= — }^(abu)^. 


8. Write out the dual statement of this §4 and those examples. In 
particular, if two lines u, v are conjugate for the conic (abuY = Wa* = 0, 
then UaVoL = 0. 


5. The Clebsch Transference Principle. Extensionals. 

We have met with the elegant theory of extensionals 
(Corollary III, p. 49) wherein a general property of n-rowed 
determinants leads to analogous properties of determinants of 
higher order. This conception influences the symbolic invariant 
theory, particularly in exhibiting the actual working of projection 
from one space to another. As a rule the methods of algebraic 
and pure geometry are alien to each other, only having some- 
thing in common at the beginning and end of a chain of reasoning. 
But here they are in close touch, and furnish one of the beautiful 
harmonies of mathematics. The principle is due to Clebsch. 

An illustration leading to this principle is given by the work 
of the preceding section. In fact we can look on the expression 
+ (5)» as a symbolic binary form in 

variables if we treat a^, a^ as binary symbols Ai, A 2 by 

taking 

^2 = (^z 9 so that A^ = . . (6) 

Thus in Ex. 5 above, the original .quadric in X is now A^^; 
and if equivalent symbols B are used, the condition for the 
line yz to touch the quadric now reads as 

\ J (06 I = 0. . (7], 



GEOMETRICAL INTERPRETATIONS [Chap. 

This shows that from a certain binary invariant (usually 
symbolized by (ofe)® and here by (AB)^) we deduce a concomitant 
involving an arbitrary line yz in higher dimensions, such that 
the points common to a quadric and the line coincide if the 
binary invariant vanishes. 

By the principle of duality we can also write 

\{AB)^ == \(abuv . . . w)^ 

in terms of (n — 2) primes w, v, , . . which suffice to determine 
the same line yz, ^ 

This instance easily leads to the Clebsch transference principle, 
which is: 

If 

is the symbolic form of an invariant of binary ground forms a/, 
b/, • . . , then the corresponding expression 

f{ iph\xy), 

where each bracket factor has been replaced by a second compound 
always containing the same x, y, has the same geometrical significance 
for the points common to loci a/ = 0, b^^^ = 0, . . . , and an arbitrary 
line xy in (n — l)fold space, that the invariant I has for points 
on the line illustrating a binary field. 

Proof . — 

With the notation of this section, if the geometrical 
property of the p + q-}- . . . points on a line is given by 
f { (ah), . . . } = 0, then that of the corresponding points on the 
line xy is given by / ] (AB), ...}== 0. 

But (AB) = — a^b^ = (ab | xy). ... (8) 

Hence / { (ab j xy) ... } = 0, 

which proves the theorem. 

Covariants. — More generally a binary covariant involving 
a variable x is transferred by this principle simply by altering 
bracket factors, as in (7), and leaving inner products unchanged. 
This procedure is implied in conditions (6) above. 

For example, the binary covariant equation 

(ab)aJ},,^Q 



XIX.] TRANSFERENCE IN GENERAL 289 

gives the point pair at once harmonic to and h^. Hence on 
a line mj, the corresponding points are given by 

(AB)A^B^ = 0,’ 

or iab\xy)axb^=0, 

or (abu . . . t)a^b^ = 0, 

replacing X by x, provided x denotes the variable point on the 
line u . , ,t cut by the loci b^. This concomitant of course 
also has a significance for points x in space, which are not on 
the line; but in that case it has no direct binary relation. 

Transference in General. 

An invariant 'property of a lower can ahvays be transferred to 
a higher field by this extension of symbolic bracket factors. 

For we merely have to replace a bracket factor {a^a^^ ^ 
of a field of order r{<in) by an rth compound 

(a^aj . . . a,. I . . . a?,.), 

or its equivalent outer product 

{a^a^ . . . a,u^U2 . . . 

and interpret the result in a field of order n. The reader wiL 
have no difficulty in supplying a formal proof by the methods 
of §4 (cf. p. 184, ( 8 ) ), by starting with r distinct points 
Xi, . . . , X,., and r parameters . . . , in place of the 
previous 

EXAMPLES 

1. The ternary condition (ahu)axhx = 0 gives the points a; on a line 
u which arc conjugates with regard to two conics ax^, 

2. If {ah I xif)axhx = {ahuv)axhx = (abp)axhx vanishes, the points x on 
the line p are conjugates for each of two quadric surfaces aar*, hx^. 

3. If (hcu) (cau) (abu) = 0, a certain trio of conics is cut in involution 
by the line u. (§4, p. 218.) 

4. Interpret (bcp) (cap) (abp) = 0 for a line p in threefold space. 

5. The envelope of a line which cuts two conics harmonically is a conic. 

[For (abu)^ = 0 is quadratic in the line co-ordinates Wj, Wj* 

6. The totality of lines which cut two quadric surfaces harmonically is 
the Battaglini quadratic complex (abp)® — 0. 

We write (abp)^ = (abuv)^ ^ (ab 1 xy)^ where x, y are two points on 
(D884) 20 



290 GEOMETRICAL INTERPRETATIONS [Chap: 

the line and u, v two planes through the line. If ^ is fixed, x describes 
a quadiio surface. Dually, if v is fixed, the plane u envelops a quadric 
surface. 

6. Projective Properties. 

The Clebsch principle gives an instantaneous proof that the 
vanishing of an invariant of ground form is indeed the algebraic 
interpretation of a 'projective property (§2, p. 271). For sim- 
plicity let us consider projection, in three dimensions, of a figure 
in a plane w to a plane t? from a vertex d. 

Then we call the point x of plane u the projection of a?' in 
the plane v, if points x, x\ d are in line. It is essential that u 
and V should be distinct planes, neither passing through the 
point 6. 

Now consider a quaternary invariant of any number of points 
d, X, z, t , , , , Let it be 

/{{xyzt), (xyze),...}. 

If all the points except 6 are in the plane u, then {xyzt) vanishes 
identically, because every four coplanar points are linearly related. 
Hence the function is entirely composed of factors, each including 
0y which we now write f {(xyz9), . . . } . 

But for any other point x' in the line 6xwe can take 

a?' = X + X0 


{A scalar). Hence, if also y' = y + ix0, z' — z + v0y then 
(x'y'z'0) (x+ X0 ,y + fjL0 .z + v0 ,0) = {xyz0), 

since all other terms involve two or more 0’s in the expansion of 
the bracket factor, and consequently vanish. Thus 

f{{xyze ), . . . } = /{(x'y'z'd), ...} 
showing that actual projection maintains the invariant property. 

Further, if we suppress 0 in each factor, the invariant 
f{ixyz), . . . } now exhibits a ternary property of points in the 
plane u. It follows that every ternary invariant equated to zero 
specifies a projective property. Such an argument is general, 
true for all dimensions, for any number of successive projections; 
and indeed can be extended to include symbols as well as points 
among the bracket factors. 



XIX.) 


COLLINEATION 


291 


7. First Geometrical Interpretation ol Linear Transformation. 
OoUineation. 

Let M be the non-singular square tiiatrix of n rows, and x 
a single-column matrix representing a point P in (n — l)-fold 
space. Then the product Mx\s also a single column, which denotes . 
a point Q. We write 

= |Af| = |ei^|4=0, x= {x^}, {^j}, 

Mx=ri, M-^i-=x. 


We use homogeneous co-ordinates, so that, if p =|= 0, 
{pxy, . . . , px„} represents the same point P. If x is given, 
a unique set ^ is found, and if ^ is given, x also is unique. 
Also since 

pMx~Mpx 


when p is scalar, the point P given by the set \ px^, . . . , px,^} 
in this way is connected with the point Q, \p^i, . . . , pin} by 
what is called a one-one correspondence. Given either point 
P, Q the other is completely determined. 

Again if the matrix M is replaced by any other, except a 
mere multiple pM of itself, a new point Q is derived from the 
same point P. So the correspondence between P and Q is 
specified by the matrix. 

Geometrically, when a one-one correspondence connects points 
P of a given field with points Q of a second given field (which may 
coincide with the field of P, as in the present case), the correspon- 
dence is called a collineation. Thus: 


For a given frame of reference a non-singular matrix M of 
order n determines a collineation for points of the field. 

Or again, 

A linear transfot'mation x — > ^ determines a collineation 
between points P(x) and points Q(^^). 


EXAMPLES 

1. The equations 

5^ == -I- + njOJa, 

^2 ~ I ^1^12^3 I =1= 0, 

5 s == ^3^1 + -f 

deterihlne a collineation between the point P(xii X 2 t x^) and* the point 
Si, Sa) of a plane. 



29 » GEOMETRICAL INTERPRETATIONS [Chap. 

« 2. If P lies on a given line, Q also lies on a given line. 

8. If P describes a curve of order n, so does Q. 

4. In ordinary space, if P lies on a given line, so does Q; il P lies on a 
given plane, so does Q. 

' 5. Generalize this set of results. 

6. If P1P2P3P4 are four points in line* the cross ratio {P1P2PZP4] is 
the same as that of the corresponding points [QiQzQzQil- 

7. An involution on a straight line (§4, p. 218) is a symmetrical col- 
lineation. Thus if, on a line A, P corresponds to Q then when P is situated 
at Q, Q becomes P, 

8. The general definition of involution in space is symmetry of col- 
lineation. Prove that the necessary and sufficient condition for an 
involution is Af * = p /, where p is any non-zero scalar. 

[AT-i - pi/. 

8. Latent Points 0 ! a Transformation. 

For certain positions of Xg, . . . , the corresponding 

points P and Q will coincide. These are called latent points. 

In general there will be n distinct latent points; for if P and 
Q coincide, we have for some value A, 

e,iXi + e,^X 2 +... + e,„x„=Xx,, r=l,2,...,n. (9) 

Written in full this gives n linear equations from which, by 
eliminating x^, Xg, . . . , x,„ we have the so-called characteristic 
equation (p. 98) of the matrix: 



eu- A 

^12 

. . ei„ 



^21 

^22 ^ 

• • ^2n . 

= 0. (10) 



^n2 

• • ^nn ^ 



In the case when there are n distinct roots A^, Ag, . . . , A^^ of 
this equation, which is a binary w-ic in A, we determine one set 
of values x^‘^ by solving (n — 1) of the equations (9) for each value 
A^- of A. Thus we should have x^*^ given by a row of first minors 
of the determinant /(A^), and the n sets x^"^ so found will be 
distinct. For if not, let == Then substituting x^‘^ and 

in turn in (9) and subtracting, i.e. A, = A^-, 

which is contrary to the hypothesis. These n points are. also 
linearly independent and form a simplex (p. 296). , 



XIX.] 


LATENT POINTS 


m 

The particular simplex (p. 86) whose n primes are given by 
the n equations x~0 is called the frame of reference. If 
w=3, it is the familiar triangle of Reference; if n = 4r, the 
tetrahedron of reference; and so on. 

Example . — 

A collineation referred to its latent points as frame of reference takes 
the form \ = Xg • • • > ~ 5a. 

9. Second Geometrical Interpretation of Linear Transformation, 
Change of Frame of Reference. 

Instead of maintaining a fixed frame of reference and inter- 
preting a linear transformation as a" collineation, we may consider 
that the geometrical figure is fixed, but a change is made in the 
frame of reference, exactly as was done in Ex. 1, p. 151. 
Again it will be found that this illustrates the same algebra. 

The three homogeneous point co-ordinates Xi of that example 
are replaced by n such co-ordinates; and the three line co- 
ordinates Ui, by n prime co-ordinates u^. Then = 0 is the 
point equation of the prime u (or dually is the prime, or tan- 
gential, equation of the point x). Accordingly, we interpret 
contragredient linear transformations x—^x\ w — >• u\ for 
which as giving the same point x and the same 

prime u referred to a new frame. 

Let the point P referred to one simplex have co-ordinates 
I iCj, X 2 , . . . , and to another have co-ordinates 

x' = { x^, Xg', . . . , X,' } . 

Also let 

x^, = -f- 0,-2 3^2 -f- . . . “f* , r == 1, 2, . . . , w, (11) 

where the matrix C = [Cij] has rank n, so that | (7 j =t= 0, then 
the n primes given by = 0 have, for equations in terms of 
^ 1 ', . . . , xf, 

These will be the equations of the primes of the simplex of refer- 
ence for X in terms of x\ And if we solve equations (11) for x' we 
likewise obtain the primes of the second simplex referred to the 
first. 

It is well to have these two distinct interpretations of a linear 
transformation, for both have their value. 



294 GEOMETRICAL INTERPRETATIONS [Chap. 
10. Reciprocation and Correlation. 


It has been remarked (§2, p. 283) that there is a second type 
of duality besides the reciprocity generated by the texture of 
space itself. In the second, a certain geometrical locus or manifold 
or ground form F is required which gives rise to the reciprocity. 
Let us confine our investigation to the case when F is given 
algebraically by a matrix of a bilinear form O; and for shortness 
let it be a ternary form, as typical of the general case. 

We consider 


F = 


aji 0^2 ^13 

®21 ^22 ^23 


= [«o] • 


L ®3l ®82 <^33 J 


O = + «12*iy2 + Oi3*i2/3 ] 

• 4 " * 21 * 2^1 "i* * 22 * 2^2 * 23 * 2^3 | 

4" ^31*3^1 4" *32*3^2 4" *33*32/3 I 
= I.aijXiyj = a^by J 


(12) 


(13) 


where a^j = a^bj, symbolically. 

Let X, y denote two points of which y is fixed. Then 0 = 0 
gives the equation of a straight line, since it is linear in x. This 
is called the folar line of the point- y with regard to O, and 
correlatively y is called the pole of this line. Thus if = 0 is 
the polar of y for this bilinear form, then on comparing coefficients 
we may take 

% — + %2y2 + ^3^3 

^2 = «2iyi + ^22^2 + ^23^3 

^3 ~ ^31^1 ”1“ %2y2 *4“ ^33^3 



Further, if the determinant | | 4 = 0, we have by solving 

these equations, 


y^ = -f 


. . (16) 


^3 = + a23^2 + J 

where a*-' is the typical element of the reciprocal | | of the 

determinant [ |. Thus 


-a“ 


a8i- 

ai2 



1 

03 

a23 

aS8. 




. (16) 


r-‘ = 



CORRELATION 


XIX.] 


29S 


The linearity of these conditions (14), (15) shows that a one- 
to-one correspondence exists between pole and polar for the 
ground form F. Every point of the plafte has its polar, and every 
polar has its pole. 

11. Correlation.* 

An interesting algebraic feature has presented itself in the 
system of equations (14), (16), connecting contmgredient variables 
y and u, for hitherto we have only dealt with cogredient trans- 
formations. So we make the following definitions. 

Definition of Correlation and Collineation. — A linear trans- 
formation connecting contragredient variables is a correlation, one 
connecting cogredient variables is a collineation. 

Transformations a; — > w, u x are correlations, while 
x\ n! are collineations. We classify correlations as 
symmetrical, skew symmetrical, and general. 


EXAMPLES 

1. If r is a symmetrical correlation, then its matrix furnishes the 
coefficients of a quadric, 'LaijXiXj,aij= aji. The correlation x — > u is now 
one aspect of polar reciprocation with regard to the quadric: it replaces 
a point X by its polar. 

The inverse correlation u — x replaces a polar by its pole. 

2. If r is skew symmetrical, then aij = — aji. Symbolically 
axhy = — ayhx = i(a6 | xy). 

Prove that every point lies on its polar. What is the inverse property? 

When pole and polar are so incident the correlation is called a Null 
System. 

3. A quaternary linear complex (ahn^ = 'Laijp;) generates a Null 
System. For if the line p= xy belongs to the complex then Eacjpi j — 0. 
If y is fixed, x describes a plane whose equation is (ah \xy)^ 0. This is 
called the polar plane, and clearly y lies on it. 

4. What is the dual of Ex. 3? 

5. Can this be generalized? 

0. Show that a Null System breaks down in space of even dimensions. 

[If n is odd, the skew symmetric matrix T is singular. 

12. Canonical Form of a Matrix. 

A very interesting application of the twofold geometrical 
interpretation of a linear transformation is provided by the 
following theorem. 



GEOMETRICAL INTERPRETATIONS [Chap. XIX. 

A matrix M all of whose latent roots Aj, Ag, . . . , A,j are distinct 
from each other and from zero can be expressed as a product ALA”^, 
where A is non-singular and L is a diagonal matrix consisting 
of Aj, A 2 , . . . , A^j. 

Proof — 

For let M denote the collineation changing a point x to 
so that x = Since M has n distinct latent roots it has a 
simplex of n latent points. For, if there were a relation 

+ . . . + = 0, . . , (17) 

with none of the p’s zero, for k of the latent points 

. . . , x ^"^ {k < n)y then, as in the special case of §8, we 
would have Sp-A^a:^*^ = 0. From this and (17) one could 
be eliminated, giving a relation for ^ — 1 points, then similarly 
for k — 2 points, and finally for one point, which is absurd. 
The k points are therefore linearly independent. 

Let the change of frame from the original to this new 
simplex be given hy x^ Ay and ^ = At), so that the 
cogredient sets x, | are now replaced hy y, rj. 

Hence y — > -yy is the collineation referred to its latent points 
as frame. This changes y^ to y^ to AgT^g^ • • • > to 
(Example of §8, p. 293), so that y=Lrj where L is the diagonal 
matrix of Aj, Ag', . . . , A,*. 

But by elimination of rj, y we have 

Mi= x= Ay= ALrj=::^ ALA-'^i 
true for every point Hence M = ALA~'^. 

The corresponding theorem ^ when latent roots are zero or 
coincident is true, provided L is suitably modified. 

EXMIPLES 

1. By expanding the equation MA = AL in the above, for the case of 
three-rowed square matrices, verify that Xj, Xg, X 3 are latent roots of the 
matrix M, and that there are nine linear equations, from which the 
elements of M can actually be determined. 

2. ftove the Cayley Hamilton theorem by using the canonical form 
of a matrix. 

' Cullis, Matrices ard Determinoids (Cambridge, 1926), III, p. 342. Dickson, 
Modem Algebraic Theories (Chicago, 1926), Chap. V. Bocher, Higher Algebra 
(New York, 1919), Chap. XX. 



CHAPTER XX 
The General Quadric 

1. Complete System of tbe General Quadric. 

Let = 

be the symbolic form of the quadric F homogeneous 

in n variables iCg, . . . , x,^, (n > 2), where a, h, , , . , m are n 
equivalent symbols, defined by the identity 

true for all values of i and j from 1 to n inclusive. 

We can prove that the synibolic expressions 

^2 == {ab I xy)\ Ag := {abc [ xyzf , . . . 

^n~i ~ {db , , il \ xy , , .)2, A = (a6 . . . lm)^y 

all of which are concomitants of the qimdriCy form a complete irre- 
ducible system for a single qtiadric f — A^ = a^^ and any number 
of linear ground forms. In other words, every polynomial con- 
comitant of one quadric and any number of variables a?, y, 2 :, . . . , 
Uy V, Wy , . . is expressible as a polynomial in Aj, Ag, . . . , A,^ 
and of polars of these forms. 

Proof . — 

Let each variable a;, y, ... be resolved into {n — : 1) cogredient. 
variables of type w, so that any concomitant is now symbolized 
by a polynomial in outer products only, such as 

= {abc . . . uvw . . . ). 

Here there are r equivalent quadric symbols convolved, together 
with w — r variables UyVy 

207 



298 THE GENERAL QUADRIC [Chap. 

Now let a single-term s 3 rmbolic product of v factors, involved 
in a concomitant, be 

where the suffixes r^, rg, . . . , r,. are arranged in descending order. 
Somewhere in the product beyond the first factor will be found 
the duplicate symbols of Either they are all in the second 
factor or not. If they are, then fg = since rg cannot exceed r^, 
and the first two factors of P are 

an actual polarized form of or A,.^ itself. Then P is said to 
be reducible; we remove this factor and deal with the residual 
lower degree factor. 

, But if does not contain all the duplicates of A,.^, we 
transform P by the process of §8, p. 193, and convolve A^.^ a second 
time in this second factor at the expense of other symbols and 
variables originally within the factor. Thus 

where A is numerical, and JB/ F/ is part of the original contents 
of this factor. If i = 0, again a factor A,.^' emerges. If i > 0 
we place this second factor first, with its increased currency 

+ i of equivalent symbols, and proceed as before. 

Since i cannot exceed n — the process of so raising the 
currency is finite, and P is thereby expressed as reducible terms 
containing factors A,., r = 0 , 1, . . . , w. This proves the theorem. 

Corollary I. — A single quadric has only one invariant — its 
discriminant. 

Corollary II. — The complete system for the dual form 
S = Ua^ == ^ . . . , is 

S, (aj8| uv)^, {d^yl ...» (aj8y . . . jj)^. 

EXAMPLES 

1. The system for a ternary quadratic a®* = ft** ~ is ax^, {abiif, 
(ahc)^. What is their geometrical significance ? 

2. A CO variant conic exists for a conic and a single point. 

Ans. (ah | is the covariant, if y is the given point, and x the 
variable. 



COMPLETE SYSTEM 


XX.] 


299 


3. A contravariant conic exists for a conic, in tangential co-ordinates, 
and a single line. 

Ans. (ap I wc)* where Wa* = is the quadratic and Cx the linear form. 

4. If = 0 is the line at infinity what does (ap | hiY = 0 represent? 

[The line is parallel to an asymptote of the conic. 

5. The bordered determinants (pp. 103-105) give the non-symbolic form 
of the irrediLcible concomitants Ai. 

6. The equations of these concomitants Aj are the respective conditions 
that a line xy, plane xyz, . . . should touch a quadric, 

[Use the Clebsch transference principle. • 

7. // A = 0 the quadric is a cone. 

For if axttf = 0, byb^ = 0, . . . = 0 are n equations of rank 

r = n — 1, where (xyz . . , i) 4 = 0, they can be solved, for 5 (§9, p. 195). 
Also, by eliminating |, they give A = 0. Any point 0 is given linearly by 
n general points x, y, . . . , whence a^a^ = 0, in particular if 0 = 

= 0 so 5 is on the quadric. Then if 0 is also on the quadric, so again 
is 5 + X0 (§4, p. 286), and therefore a line ^0 is on the quadric. This 
identifies 5 ^ vertex and 5fi as generator of a cone. 

If the rank is w — 2, 5 lies on a line vertex : when n = 4 the cone is 
now two planes. If the rank is w— 3, ^ lies on a plane. And so on. 

8. Taking A = 0, 1 and the vertex 5 as ‘1, 0, 0, ... , 0\ in 

variables Xi, Aj, . , . , Xn$ prove that the quadric must be a function of 
Aj, . . . , Xn only. 

9. If r = n — 2, prove that A = An-i — 0 identically. Taking the line 

vertex as ^”1 = ~ ^ prove the quadric is a function of Ag, . . . , Xn only. 

10 . If An — 1 = (abc . . . luy == 0 but not identically, then the prime u 

touches the quadric. This A«-i can also be written where 

a = abc . . . Zin the notation of (35), p. 47. 

[Cf. p. 287, §4, Ex. 5, 6 , 7. 

2. Self-conjugate Simplex. 

A triangle xyz is self-conjugate for a coplanar conic if x is 
pole of yz, y of zx, and z of xy, 

A tetrahedron is self-conjugate for a quadric surface if x is 
pole of a plane yzt, y is pole of zxt, z of xyt, and t of xyz. This 
property can be extended to the general case. 

Let x,y,z,...,t] 

f 

u,VyW,,,.,qJ 

denote the n points and corresponding primes of a self-conjugate 
simplex (p. 86 ), such that pote and polar are in a vertical column. 
Then we have the relations (cf. Ex. 4, 8 , p. 287) 

= 4 = 0 , Ufi^ = 1 = 0 , ^ =0 • ( 2 ) 

as typical of any of the points and primes of the table. 



[Chap. 


300 THE GENERAL QUADRIC 


3. Canonical Form of the Quadric. 


We can derive an important theorem from this set of relations 
(2), whereby a general quadric is expressed as the sum of at most 
n squares of linear forms. 

For consider the identity 

(uvw . . . q)a^ = 3 = {avw {uaw . . . q)v^ + • * • + . . . a)qt. 

By utilizing the self-conjugate property of the simplex (1) we 
write 

{am . . . g') = {uaw ,,.q)^ay, &c. 

tW 

{um . . . q)a^ = a^ut + a^v^ + . • . + a^q^, 


which is true for all values of a. By squaring this identity we 
obtain 


{um . . . qfat^ = -f- . . . -f 


a result of fundamental importance. All the product terms on 
the right have disappeared because of the conjugate properties 
such as a^ay= 0. 

Now regarding aj, y, . . . , te, . . . , as constant, and ^ as 
variable, we have thus expressed a general quadric 

as the sum of n squares 


where 
and where 


. . . + AM 

-^1 _ ^2 _ _ 1 _ 

fh) fiy) /To {um..,qf' 

Xi u^f X2 “ • • • 9 Xf^ == q^ 


are linear forms in the original variables. 

This is called a normal or canonical form of the quadric. 
It has a very simple matrix of coefficients, consisting of diagonal 
elements A^, A^, , A^ only. In making this reduction we had 

all {n — l)-fold space, except on the quadric itself for the choice 
of the point x\ one less dimension for the choice of y: and so oil. 
Algebraically we sum this up by saying that the reduction to 
canonical form is possible in 00 ^ ways, where 


1)**}" (w— ' 2) 1 — 1). 



XX.] 


CANONICAL FORM 


301 


EXAMPLES 

1. Use the dual identity 

{xyz.,. rf . . . + 

to reduce the tangential form of the general contravariant quadric also 
to the sum of n squares, 


2, If Uj, Ug, . . . , Un are contragredient to Xj, Xn and in fact 

denote the same self -conjugate simplex, then the dual form of F is 



T/ 2 

-f 7- + ...4- 
^2 


A/i 


This follows by direct calculation from the bordered determinant of F, 


3. Find canonical forms for all members of the complete system. 

Each member Ai is a sum of squares of ?th compound co-ordinates: 
while A — A 1 A 2 . . . An* 


4. If the rank of [atj} is r, then A* = 0, A; > r but Ar does not vanish. 
Prove that the canonical quadric is now the sum of r squares. 


4. Theory of Two Quadrics. 

Let /=o*2=6^2 = c*2== ... ] 

/' = 2’/= »/=</=••• 1 

be the symbolic forms of two different quadrics 

= S Xi X.J, F = E Tfj Xi Xj . . . 

in n variables. From these two we derive a new quadric XF + XF\ 
said to belong to the pencil of quadrics determined by F and F\ 
Geometrically, whatever is common to the quadric manifolds F 
and F' is common to each of the 00 ^ members of the pencil. For 
example, two conics have four points in common, shared also 
by the members of their pencil, when w = 3; two quadric surfaces 
have a curve in common, when n = 4; and so on. 

Now since the typical coefficient, of the quadric AF+ X'F' 
is Xaij + X\j, the discriminant must be | + Xr^j | , which on 

expansion is a binary n-ic in A : A', say 

87 == AA’‘ + 01 A’‘-' A' + . . . + ©e A^‘~' A'*' + . . . + A'A'^ (5) 


• . ( 3 ) 

. . (4) 


In ternary forms we generally write this as 

Aa^i + AVjj, Aai2 4 * A 1*12, Aai3 A ^^3 

^^21 4 ” A Au 22 4 ~ A f 22 > ^^23 4 “ A 7*23 

4 ~ A 7 * 31 , AU32 4 “ AV32, Aa33 4 ~ AV33 

= AA3 + 0A2A'4-0'AA'2-fA'A'8. . . (6)^ 



302 


THE GENERAL QUADRIC 


[Chap. 


Manifestly A and A' are the discriminants of F and respec- 
tively, while the n — 1 intermediate coefiScients are simul- 
taneous invariants derived in succession by* repeated application 
to A of the Aronhold operator 



summed for the \n(n + 1) effectively distinct coefficient suffixes 
ij. It is also clear for geometrical reasons that the ratios 
A : © 1 : . . . : A' are absolute invariants, since the condition that 
the quadric XF + XF' should degenerate is independent of 
particular co-ordinate axes. The n roots of the equation in 
A : A', obtained by equating (5) to zero, are in fact examples of 
irrational invariants of F and F\ 

Probably the reader is familiar with these four invariants A, 
©, ©', A' of two conics^ as they provide interesting properties 
of the usual analytical geometry. A relation involving them 
expresses a geometrical fact about the conics, as, for example, 
that ©2 == 4A©' if a triangle can be inscribed in the conic F' 
which shall circumscribe conic F; or that © vanishes if a triangle 
inscribed in F' is self-conjugate to F. 

5. Redaction of Two Quadrics to the Form 

F =: X,^+...+ X,^ F' - A,X^^+ A,X ,^+ . . . + A,,x,^ 

Let the linear transformation be given by 
== + • • • + 

Then we take as our v parameters ^ Ip (§9, p. 268), 

the following w® + n (= v) quantities: 

Vv • • • > b2> ^2> • • • 9 ^n9 . . . , A^f A 29 ••• 9 A^^9 

in this order. 

The number of parameters in each X^ is w, so that JF' have 

^ First developed by Salmon. Cf. Salmon’s Conic Sections^ Sixth Edition, 
Chap. XVIII. A good elementary account is to be found also in Sommerville’s 
Aimlytical Conicc (^G. Bell & Sons, 1924), pp. 266-294. It is strange that recent 
books stiU omit to mention the crucial fact that these invariants form a com- 
plete system. The present writer remembers the uneasy feeling he had as a 
student when first reading this theory, and wondering why this set of four 
invariants was tacitly assumed to tell the whole story. 



XX.] 


TWO QUADRICS 


303 


n{n -j- 1) parameters between them, if each is independent. 
This fits the number of coefficients in two general quadrics. 
Corresponding to the equations (12), p. 268, there will be v 
conditions which equate functions/,-, of the parameters, respec- 
tively to the V coefficients 


%2> 


ln> ^22> ^23> 
^11? ^12> * • 


CJo' 


^22> ^23> * • • > 


in this order, of the quadrics, TtUijXiXj, Also we 

can solve the requisite n(n + 1) equations for the parameters, 
provided no functional relation ^{/ ) == 0 exists (§9, p. 268). This 


in turn is non-existent if a determinant 


S/f 

dl 


of order n{n + 1) 


does not vanish identically. By using the identical trans- 
formation, = • • • ~ Ij ^2 ~ determinant is 

seen to be a non-zero expression, di H ^ =l=i- and this 

justifies the canonical form. 

More specifically, if when n = 2 the 2x3 parameters are 
the usual r )2 ot Xy together with A^y A 2 , then the 


determinant 


dl. 


becomes 

Vi • 

• Vi 
^2 V 2 • 

• ^2 V 2 


Ail Avi 

Ai2 Av2 
• Ai2 


ii^ 

i2^ 


Avi 
AV2 

^iiVi Vi^ 

'^i2V2 12^ 


If = XJ 2 == 1, ^2 — % “ non-zero element 

in each of colj, colg, rowg, rowg. After expanding by colsu, rowsjg, 

then is left, giving {A^ — Aj) alone. This method 

% ^2’?2| 

is quite general. We delete the n last rows and those n of the 

( w -I- 1\ 

2 j columns which intersect the rows at >?2^ • • • 

then n of the first columns and their analogous rows; 
then subtract row^ from row) for n pairs of suitable suflBxes, 

getting a single imit matrix in the first ^ columns. 



THE GENERAL QUADRIC 


304 


[Chap. 


For the difficult case of specialized quadrics, when this deter- 
minant vanishes and this canonical form is not justified, the 
reader should consult a work on Invariant Factors.^ 


EXAMPLES 

1. Two general quadrics have a common self -conjugate simplex. 

2. Thc) canonical coefficients Ai are the roots of the characteristic 
equation j Xaij nj | = 0. 

For the equation is invariantive; hence in the canonical form it is 
I X - At I = (X - Ai) (X - A2 ) . . . (X - An) - 0. 

3. The symmetric functions 1, S Aj, SA,:A;, . . . , A^Ag ... An are the 
n + 1 irreducible invariants. 

4. Two quadrics have at least n quadric contravariants. 

The tangential equation of XF + F' = 0 in canonical form is 
2 u 2 

A . -j- , . . -f. IL = 0, leading to a binary (n — l)-ic for X. The n 
X + Ai X -f An ^ 

coefficients are contravariants. 

5. The Jacobian of these contravariants is a contra variant of order n, 
which has n linear factors if the n coefficients Ai are distinct. 

[Prove it for w — 2, 3, 4 and then generalize. 

6. This Jacobian denotes the common self -con jugate simplex. 

7. Reciprocate results, 4, 5 and 6. 

6. Complete System of (n -f 1) Invariants. 

The n + 1 forms A, ©i, . . . , A' are a complete irreducible 
system. 

Proof , — 

In fact, let 7 be a polynomial invariant of the two quadrics. 
Then by the fundamental theorem it can be expressed as S P 
where P is a product of w bracket factors of the type 

- (aiffla . . . . . (7) 

t 

i = 0, 1, 2, . . . , w. Here there are i equivalent symbols convolved 
in a matrix of currency i, referring to the first quadric P, 
and n — i symbols in the matrix R^-i for the second quadric F\ 
Let the factors of P be arranged from left to right as far as 

^Cf. Bromwich, Quadratic Farms (Cambridge Tract, 1906); Jessop, Line 
Complex (Cambridge, 1913); Dickson, Modern Algebraic Theories (Chicago^ 
1926), 133; B6cher, Higher Algebra (New York, 1919), Chay. XX. 



XX.] COMPLETE SYSTEM OF INVARIANTS 305 

possible in descending order of currency i. As in §8, p. 193, we 
can convolve the duplicate symbols of the first factor in the 
second factor and then if necessary rearrange factors in descend- 
ing order. Finally, we express a typical product as 

(8) 

where n>q^>q 2 > >qp>Q, 

and all the symbols of the first quadric are accounted for among 
the A* a. This product P is now said to be prepared for the first 
quadric. The symbols not yet expressed refer entirely to the 
second quadric. If now q^ = n, P contains the invariant 
= (U 1 U 2 • • * factor, and is reducible. 

Similarly if P contains a factor • • • ^/j) composed entirely 
of symbols of P', it is reducible by convolving the duplicate 
symbols in a second factor, so that the discriminant 
emerges. Thisr only happens \iw> 2v, or if q^ = 0. 

Accordingly, we suppose w = 2v, q^ > 0, so that the final 
factor must contain symbols of both quadrics. We next consider 
the symbols of the second quadric. Allowing for duplicates in 
the two final factors we write P more fully as 

+ *--+ S- = 

where B, C, D refer to the second quadric, and all the 2s„ symbols 
in C and D entirely differ. If 0, P contains the factor 0^^ 
and is reducible. So we take Sp > 0. 

Now let the possible forms P be examined in the following 
order: 

(i) By ascending weight w, 

(ii) When the weight is the same, in ascending degree in the 
coefficients of the first quadric, and therefore in ascending value 

of S 

t-.l 

(iii) When w — w', Hqi—hqi for two forms P and F, we 
examine P before P' if qi — q^,..., qi — q/, j.+i > q'l+v The 
value of i is taken in ascending order. 



THE GENERAL QUADRIC 


[Chap. 


306 

Further than this the order is immaterial. The effect of such 
an order is to render any process a reducing process, which shifts 
a symbol a towards the left out of its own factor. For the resulting 
form (or foriris) can then be prepared as in (8), when it will be 
among those already examined. 

Now since s„ > 0, we can as before convolve the duplicates 
of the Tcy + Sp symbols D in the last factor but one. Reference 
to the fundamental identity, §11, p. 48, shows that this process 
either shifts entirely, leaving 

in place of the two final factors, or else shifts some symbols a 
of to the left. The latter case can only give rise to forms 
abeady examined. 

This proves that every product P is reducible, with the possible 
exception of A, ©j, . . . , A'. In other words, every polynomial 

invariant of two quadrics is expressible as a polynomial in these 
w + 1 invariants. 

Finally these are irreducible, because a relation expressing 
any one 0^-, say, in terms of the remainder is structurally 
impossible, as is at once seen by examining the degree in both 
sets of coefficients on the left and right of an assumed identity 

0i = SAA*^0/‘...A''^--^ 

This proves the theorem. 


EXAMPLES 

1, Prove that (abrs) (abet) (erst) vanishes identically. 

2. Prove (ber) (cas) (abt) (rst) = \(abc)^(r8t)^, 

7. Complete Systems involving Variables. 

The complete system for two quadrics and all possible vari- 
ables a;, TTg, . . . , (= has not been discovered except 

when w = 2, 3, or 4. But it can be demonstrated that the 
number of covariants involving x alone is n +- 1. 

It can be shown that n of the n + 1 covariants are the w 
coefficients of A^, Ag in the binary w-ic obtained by forming the 
dual point quadratic from the tangential form A^S AgS', where 



XX,] 


GENERAL THEORY 


307 


S is the bordered determinant 




0 


(§3, p. 101). These 


give the quadratic covariants /, /' with n— 2 intermediate 
quadratics, as is well known in the ternary case. The remaining 
CO variant is their Jacobian, which represents the common self- 
conjugate simplex of two quadratics. 

Also ^ if w > 2, a complete system involving any number of 
cogredient variables x, y, z, ... has, besides n + I invariants 
and n + 1 covariants, the n — 1 functional determinants 

(^i I (^n—i | ^n~i) 1=1,2, , , , y 71 1. 


A system including one x and any number of contragredients 
Uy t;, . . . has also been found.^ 

By making n = 2 this system becomes the binary system 
for two quadratics already discussed. In this case the 

functional determinant coincides with the simultaneous 

covariant, making, in all, six irreducible forms. 

As in the binary case a syzygy connects the square of the 
Jacobian covariant with the remaining 2w -f- 1 invariants and 
covariants.® 


Example . — 

For the ternary case such a completely irreducible system is 
F=:ax\ = rx^, (apa:)*, aj, rj, Tp^, apUxipxy), rafxioLxy), 
where a, p are each of currency two in symbols of their respective quadrics. 

As for the case where only variables u,v,w.,. occur, mani- 
festly a concomitant is expressible symbolically by outer products 
of type 

{AiRj Vk) = ^2 •••%)> 

where i j -f- i = w and the three matrices of symbols refer 


^ Turnbull and Williamson, Proc. Royal Soc. Edinburgh, 45 (1926), 149-166. 

* Transactions Cambridge Phil. Soc., 21 (1909), 197-240. 

For ti = 3, the ternary case, the system of two quadratics consists of 20 
forms: Gordan, Math. Annalen, 19 (1882), 629. See also Grace and Young, 
Algebra of Invariants (1904), pp. 280-287. This has been proved to be strictly 
irreducible: Van de Waerden, Amsterdam Ak. Versl., 32 (1923), 138-147. For 
three ternary quadratics Ciamberlini found a system of 128 forms, Oiorn. di 
mat. (Battaglini), 24 (1886), 141. Of these six are reducible. For four or 
more ternary quadratics see Proc. London Math. Soc., 2 , 9 (1910), 81-121. 

For n “ 4, the quaternary case of two quadrics, cf . Gordan, Math. Anrmlen, 
56 (1903), 1-48; Turnbull, Proc. London Math. Soc., 2 » 18 (1919), 69-94. This 
system has 123 concomitants. 

^Gilham, Proc. London Math. Soc., 2 » 20 (1921). p. 326. 



THE GENERAL QUADRIC 


308 


[Chap. XX. 


to the quadrics F, F' and the variables respectively. By the 
preceding methods it can be shown that all possible irreducible 
forms are included among the following and their polars: 

Pij = i, j,k = 0,1, i+j+k = n. 

(A,U) (A,^Itj,U') (JtjA^U ") . . . {Rj^m''^) 

w tj ^ 1*2 ^ . . . ]> i„, J 2 • "^jv 


Other references to the literature will be found in the Encyklopadie der 
Mathematiachen Wisaenachaften, III, 3, 6 (1922), and the earlier Berichte by 
W. F. Meyer. More recently with reference to the cases n = 4, n = 6, cf . Proc. 
London Math. 80 c. 2, 25 (1926), 303-327, and Proc. Roy. Soc. Edinhurghy 46 
(1926), 210-222 and 48 (1928), 70-91. 



CHAPTER XXI 

Miscellaneous Recent Developments 

1. Restricted Transformations. 

Hitherto we have dealt with the projective invariant theory. 
It is possible to extend the same methods, recently developed by 
Weitzenbock^, to special cases in which the transformations are 
restricted within a subgroup of the general group (§7, p. 161). An 
invariant of a subgroup is a function which satisfies the invariant 
definition for all transformations mthin the subgroup: and the 
more restricted the group the greater will be the number of 
possible invariants, because they are required to satisfy fewer 
conditions. 

Consider the non - singular coefficient matrices, where 
ni = n — 1 , 








• . . 

1 



^11 

. , ei,^ 







M == 


... 









o 




. . . 




''ill 

• • J 



.0 

... 0 

- 



"«ii • 

•• «iii, 0 


% 

0 

. 0 


II 

«ii.i • 

' • ^iniH ^ 

D = 

0 

^22 ® • • 

. 0 



-0 . 

.. 0 

^nn - 


.0 

0 

• ®im 





These are in order, as regards degree of restriction, and each 
generatcfS a group. For the identical transformation x ^ Ix\ 
'Cvery function is ah invariant: for the scalar transformation 
^ = f>Ix\ every homogeneous function is an invariant: for the 


^ Cf. Invariantentheorie, Chap, IX-XII. 



RECENT DEVELOPMENTS 


[Chap. 


3X0 

diagonal transformation x = Dx\ every gradient (isobaric) 
function is an invariant: for the general case x= Mx' we have 
the preceding invariant theory. What can be said of the inter- 
mediate orthogonal and affine cases? 

The affine subgroup is given by a matrix Here we deduce 

~ ^nn^n > (^) 

while the other variables x^ have general linear transformations. 
Since | | = | | 4 = 0, we can suppose x,,, x,l to 

be constants and x^, . . . , Cartesian co-ordinates in m-fold 
space. If m = 2 this collineation » — > x' is seen to leave the 
line at infinity latent ; if m == 3, the plane at infinity latent. 
Parallel lines remain parallel after transformation; and these 
facts are true for all values of m(— 7i — 1). 

The Fundamental Theorem of symbolic methods for affine 
transformations now runs as follows: 

Every polynomial mvariant K of affine transformations for p 
given ground forms f2> • • • > fp is ideMical with a projective 
invariant of the same ground forms together with a certain linear 
form L = lx latent in the transformations. 

For example, if y, z, t are three coplanar points, then (yzl)j ly, h, It are 
projective invariants of a line I and the points. The invariant 

I = (yzt) llylzk 

is absolute for affine transformations. Also, if we take lx = x,i = as in 
the preceding work, and we write 0:3 = ^3 = Z3 = 1 with (y^, yz)^ (zi, 22)* 
(ti9 t^) as Cartesian co-ordinates, then ^7 becomes the area of the triangle 
y^. 

Again the affine theory of a conic rests on the invariants of a quadratic 
= fe* and a linear form lx. 

Thus C — (aW)* vanishes if for these Cartesian co-ordinates the conic 
is a parabola. Non-symbolically 

61 0 * 

C == <12 63 ^ ~ (Ul&2 — ~ 2(fluU22 — 

U3 63 1 

2. Preparatory Redactions leading to the Proof of the Funda* 
mental Theorem. 

First we consider the latent linear form x^^, and write it sym- 
bolically as 

• (3) 


L = {lx) = l^=liXi + l^Xi+ ... +^x„, 



XXL] 

where 


AFFINE TRANSFORMATION 


3n 

= . . (4) 

This artifice brings the particular form L into line with the 
general type of linear form. It follows that 




■y 


II 

. • . , 

(k) = a„ ) 





h ... 

d. 

0 


{ab . . 

.dl): 

= 



d^ 

0 

V 

= (a6 . . . d)„ 





h 

da 

1 



where this last suffix n denotes that the determinant has n — 1 
rows numbered 1, 2, . . . , n — 1. 

Next, we state the enunciation of the theorem in terms of 
possible symbolic types: namely, every polynomial invariant 
K of the affine group can be symbolically expressed by 
the factors 

{ab...dl)={ab...d),„ (M = a„. (6) 

Here a, h, . . . , a, j8, . . . denote variables or symbols of the 
ground forms, while I denotes the set (0, 0, ... , 0, 1). 

Thirdly, if the proof holds for linear forms a^, 6^, . . . , 

, it will hold as before for the general ground form. In fact 
the symbolic methods hitherto used, together with polarization 
and the Aronhold process, still continue to be valid in rendering 
all ground forms multilinear, as these processes have nothing to 
do with the coefficients Cfj of the matrix Mi, which alone has 
been modified by the affine conditions. 

Fourthly, by expressing each linear form as an (n — l)th 
compound (a6 . . , rf | xy . . . z) == S + we reduce the 

problem to that of ground forms 

^xy • • • ...... (7) 

all of one type, whose symbolic invariant types will now be 
(a6 . . . m), (ab . , . dl) only, in place of (6). Then invariants 
may contain groups of (n — \) symbols owing to implicit 
convolution of each symbol a. If these symbols a are finally 
restored they will merely add the other types (ajS . . ./i), aa» 
to the list (cf. §9, p. 207). 



312 


RECENT DEVELOPMENTS 


[Chap. 


3. Characteristic Invariant Property. 

Since *'■8 before, the transformation a' — >• a is given 

by a' = Ml a, involving the transposed matrix M-^. Hence 

«/== eifOi-i- . . . + i== 1, 2, . . . , w— 1 ) 

«»/ = + • • • + ^nrn^m + {m = tl 1) J 

Now if iT = iiC(a, 6, . . .) is a pol 3 niomial affine invariant of 

linear forms (7), then the identity 

K'=^- K(a\ h\ . . .) ^ He,j)K{a, 6, .) . . (9) 

holds for all values of 6^, . . . , eip ...» when a', 6', ... are 
given by (8). The proof of §2, p. 169 will now apply to 
show that can only be a polynomial factor of a power 

of I iH/i |. But I Jfi I has polynomial factors of two kinds only, 

= hii • • • 1> and where is a general deter- 

minant in arguments and therefore irresoluble. We infer 

= K^^K^AeanYK. . ( 10 ) 


4. Proof of the First Fundamental Theorem. 

This last identity can be written 

K{a\ b\ . . .) = J K(a, 6, . . .). . (11) 


We have two cases to consider. 

Case (1) K{a, b, . . .) contains no symbol a,p 6,j, ... at all 
with suffix n. 

Case (2) K (a, 6, . . .) contains some symbols with suffix n. 

Case (1). K is now a projective invariant of linear forms such 
as 


+ . . . + (Jn-l ^n-l 


in the field of order w — 1. For the transformation (8) a— > a' 
in this case contains no a,/, 6,/, . . . , so that J5C' is free from the 
argument e^n* Hence in (11), 5 = 0 and K' = AJi^i K. Thus K 
can be symbolized entirely by means of factors of type 

{ab . . .d)n^ {ab , , .diy 



XXL] FUNDAMENTAL AFFINE THEOREM 313 

Case (2). Here K{ai', . . .) involves . By 

means of the particular affine transformation 

Xi=x/ (t= 1 , 2 , n— 1 ) 

(^ 11 — 1 ~ 1 ) 

we gather that ( 11 ) is satisfied only if K is homogeneous in the 
quantities a„, 6 ,„ Let us therefore write 

K = K^9i + K^g^^-... + K,g, (h>2), . (13) 

where each Ki is free from 6 ,„ . . . ; and each gi is a form of 

order s in the set a,„ 6 ,„ Also let the right side of (13) be 

brought to its lowest terms as a function of a*, . . . , so that the 
number h cannot be diminished any further. 

Hence, by ( 1 ), 

Ki'gi' + . . . + K,:g,' == A:;_,(e„„)“(A,^i + . . . + K,g,). (14) 

If we substitute for each a,', 6 ,/, ... in g/ on the left by 
means of 

+ « 2 »« 2 + ••• -f- &c., . (16) 

then each g/ is a polynomial of order s in e,„,. Equating the 
coefficient of e„„" on both sides, we obtain the identity 

Ki'gi 4- • • • + Ki'g^ = A;,_i {K^gi + . . . -f- K/^g,,); 

whence 

..... (16) 

As in case ( 1 ), each is now a projective invariant of the field 
of order n — 1 , and thus can be expressed entirely by means 
of factors (06 . . . d)„. 

Now let V be the number of symbols a,b in ii, which is 

homogeneous in the n elements a^, a^, , «„ of each such 

set a. Then either v > w or r < m. If the former, we choose 
n symbols a.b, . . . ,m and develop iC as a Gordan-Capelli series 
((22).p. 254) ^ 

K = Ko+(cib...m)Ki+... + (ab...m)^Kx, (A> 0 ). (17) 

Again these forms Ki will be affine invariants, satisfying ( 11 ), 
each of which can be dealt with in the same way, if it still contain 




314 RECENT DEVELOPMENTS [Chap. 

n symbols \ . Finally we are left to consider the case of 

at most n — 1 symbols in Jf, so that i/<n— lorv=n — 1. 

If p < w — ' 1, no factor (a6 . . . is possible, although by 
(16) jS*^ is expressible by such factors. Hence each Ki can only 
be a constant Cj-, and consequently r = 0; so that 

+ C2i72+ • • • + = • • (18) 

where is a form of order s in 6,*, . . . alone. 

Also, if v =? n — 1, then 

K={ab... d),; (c^g^ + . . . + Cng,,) = {ah,.. d)Jg , , 

and we can discard the factor {ah .. . d)J which is of the desired 
type, and confine ourselves entirely to g^. 

Finally, it can be proved that gg vanishes identically. For 
by (11), we have the identity 

9H=^{^nny9s (19) 

But if we take the particular n — 1 sets of values 
a= 1 0 0 ... 0 0 

6 = 0 1 0 ... 0 0 


rf =0 0 0 ... 1 0 , 

then = 6,^ = . . . = 0, so that g^ = 0, while, by (15), 

9n ^ 9tt{^lny ^2n9 • • • f — • • (^ 9 ) 

But (19) now shows that g/ vanishes, so that g^iei^, . . . , J 
vanishes, although its arguments are arbitrary. Hence it 
vanishes identically. This completes the proof of the First 
Fundamental Theorem for affine invariants. 

5. Consequences of the Theorem. 

Since the Fundamental Theorem links afl&ne invariants with 
projective invariants, by means of the additioqal linear form 
L s it follows at once that all the main theorems apply to this 
restricted case: the Second Fundamental Theorem, and the 
theorems of Gordan, Hilbert, Clebsch, and Peano. Further, we 
can imagine, in the preceding proof, that an arbitrary general 
linear transformation Tq has been applied to the original vari- 



XXI.] AFFINE GROUP WITH FIXED POINT 


3^.5 

ables, expressing them in terms of a set ^ 2 , and in 

particular that 

+ ?2^2 + • • J 'r Sitin' 

Applied to the affine symbolic forms this replaces (ab . . . d)„ 
by a type {ab . . . dq) and a„ by a^, wkich are no other than the 
types {ab . . .dl) and aj already utilized. Hence any given 
linear form may be taken as the latent form of the transformation. 

Of course, if the x,, co-ordinate is selected as latent, we must 
note that certain fundamental identities, not of projective type, 
will arise (cf. Ex. 4 , p. 51), such as 

{afiy)x2 = {xPy)^3 + iaxy)P2+ Ma;)y3 

instead of the usual 

{a^y% — {xPy)la^+ {axy)l/)+ {a^x)ly. 

Examples . — 

1, Examine .the restricted transformation x = Mi x' where Mi is the 
transposed matrix of Jfj, 

It leaves a point u\ latent. By reciprocating the above work, its 
concomitants are symbolized by types 

(S ; " J)l’ = (“p • • • 

( Weitzenhock.) 

2, The affine group with a fixed point is given by the matrix Mq, 
Prove that a point u\ and a prime lx are both latent; and that the 
requisite symbolic invariants of this group are 

(ah • m), (ah .. . dl), a\ 

(ap...ix), (ap...8X), h’ 

together with the absolute invariant h which is purely numerical. 

( Weitzenhock.) 

6. The Orthogonal Group. 

A similar theorem holds when a quadric is latent. If the 
quadric is then the projective theory of v ground forms 
/u • • • > /p together with is in close touch with the orthogonal 
invariant theory of the v ground forms alone. 

This theorem lies at the base of an algebraic account of 
Euclidean, elliptic, or hyperbolic geometry. For instance, in 
Euclidean geometry, for ternary forms is taken to be 
other types it is a general ternary quadratic. 
The theorem also covers Riemannian geometry where the 



3i6 recent developments (Chap. 

element of arc is given by ds^ == ILgij^dXidxjc = as far as 
metrical properties of small intervals are concerned. 

If a, 6, a, P are ternary symbols, then invariants of are 
composed of types (a6c), (ajSy), aa, &c. For the Euclidean 
case, if _jL then + ctgjSg = (aj j8), which 

accounts for the importance of the inner product of two vectors 
a and j3, when co-ordinate axes are rectangular. 

Again the tangential equation Up^ = 0 for the quadratic 
Xi^ + leads to a simultaneous co variant (apx)^ of two quad- 
ratics. By the Clebsch transference principle this vanishes if 
the pairs of tangents to the two conics Up^ = 0, = 0, from 

the point x, form a harmonic pencil. Then if Up^ = 0 gives the 
circular points at infinity, the tangents to the other conic must 
be at right angles. 

Non-symbolically (apx)^ = 0 gives the equation of the director 
circle. 

This theorem also throws light on elementary anal 3 rtical solid 
geometry, where such formulae appear as cos0 = IV -f mnV -f nn' 
for the angle between two straight lines whose direction cosines 
are given. For orthogonal transformation this is an invariant; 
in fact it is an inner product of two unit vectors. Likewise the 
volume of the tetrahedron, three of whose edges are unit vectors, 
is \{lVV*)y in terms of an outer product. 

It is a commonplace that inner and outer products should 
so arise, but the invariant theory shows that such products 
give a complete mechanism for dealing with the geometrical 
entities. 

7. Fundamental Theorem of Orthogonal Transformation. 

First let us consider this theorem when the latent quadric 
can be written as 

(x\x)=Xi^+ x^^ + ... + x^ 

so that the transformation is orthogonal (§3, p. 162). Further 
let us confine the discussion to the proper orthogonal case, 
by which is meant the case when the determinant A of the 
transformation is unity. If A = — 1 the transformation is 
called improperly orthogonal. The proof needs two preliminary 
lemmas. 



XXL] ORTHOGONAL TRANSFORMATION 317 

Lemma I . — A proper orthogonal transformation exists which 
transforms a given unit vector p into another such vector q. 

Consider the transformation 


2(x|y + g) 

(P + ? I P + ? ) 


(?( + ?.) — = 


(t=l, 2,.. 


n), 


where (p + 9 1 + ?) — (j? + ?)/ =4= 0- Here x/* can easily 

J 

be calculated in terms of piy qi, leading to the result 
(x' I a?') = {x I x)» 


Hence the transformation x—^x'is orthogonal. Furthermore if 
(p 1 j)) = {q\q) 0, we find, when x~ p^ that x* is q. Hence 

the transformation turns one given unit vector into another: 
although it may be an improper transformation. If so, we 
introduce a third vector such that (p 1 J>) = (? | ?) = (^‘ | ^), 
(ja + r 1 + ^) 4= 0, (y + ^ I ? + ^) 4= 0, and apply the corre- 

sponding improper transformations p r, r — >► y. Then the 
product transformation p—>q is necessarily proper. This 
auxiliary r is also needed if + ? = 0, to cover the case 
when (t? + ? 1 + ?) vanishes and the above a? — > a;' does 

not exist. 


Lemma 11. — 

{Iq I li) I + 2A - 2) (9 I qf-K 

For |-(9|9)"=A(9|9r>29,, 

^ (? I # = - 1) {? I #"*49^2 + 2A(9 1 qf-\ 

Summing for i = 1, 2 , ’ . . , n the result follows. 

Abo (|J|)(9|a){9|6) = 2(a|6). 

/ I ~ \ (9 1 lb) (aft . . . A9) == 2{o6 . . . hk), 

\()q\dq/ 


and 



3i8 recent developments [Chap. 

(^|^){?l?)(?l«)=(2« + 4)(?|a), 

and (^1 (? I ?) («^ • • • ^) = (2» + 4) (a6 . . . hq). 

8. First Fundamental Theorem for Proper Orthogonal Invariants. 

Every polynomial invariant of the proper orthogonal group for 
ground forms £ 2 , . . . can be symbolized entirely by the use of 
two kinds of factors ^ 

(ah . hk), {a | 6), 
the outer and inner products respectively. 

Proof . — 

This follows by induction. For if n= 1, the matrix M is 
the scalar unit, and the proper orthogonal transformation is 
merely a? = a;', the identical transformation. Then every 
vector is its own outer product and the theorem is obvious. So 
we assume it for m, and set about proving it for m + 1 = 

Consider the transformation coefficient matrices, 



If Mq is orthogonal in the field n, so also is for the field 
n — 1 (r= m), as is apparent by forming inner products of each 
pair of columns of either. Further if | | = 1 so also is | | . 

Hence they are both properly orthogonal, if either is. 

Now Mq corresponds to the transformation which leaves the 
vector g = { 0 , 0, ... 0, 1 } latent. We note that this is a unit 
vector, since 

= ( 22 ) 

Let yo denote the group of transformations 

x = M^x' (23) 

which transform the first m components x^, x^, , x,^ by means 
of the matrix M^, but leave = x„' latent. Then any invariant 
of the group Oq is an invariant for its subgroup 



XXL] 


.FUNDAMENTAL THEOREM 


319 


Now if we express any polynomial invariant of the given 
field, as where is a function solely of the components 

6,1 , and a function of bj : . . (j 4= n), then each ki 
will be an orthogonal invariant in the group y^^, Consequently, 
by hypothesis, our invariant is a polynomial in two types of 
factor, which we write 

{ab . . . = I aib 2 . . . | , {a\ 6)^^ — aib^ + . . . + (2^) 


together with the tliird type, a,^, . . . of suffix n. 

Also by using the unit vector == { 0 , 0 , . . . 0 , 1 } we can 
write these three types as functions of inner and outer products 
in the higher field of order n ; as is at once apparent when each is 
fully expanded: 


{ab ... »),. - <“* V,l«, 

V?|y (q\q) 


(?l«) b . 




<«!''> to 


Hence every polynomial invariant of the group Oq is a polynomial 
function of arguments 

(q 1 ?). (? 1 a), {ab... hq), {a | 6), . . (26) 


divided* by a positive integral power of 1 1 


Now is a unit vector whatever q may be; and, by 

s/q\q 

Lemma I, a proper orthogonal transformation exists which 
changes any given unit vector into a second. So instead of 
the special vector { 0, 0, ... 0, 1 } we can now take q to be any 
arbitrary unit vector, so long as, instead of the subgroup 
we take the similar subgroup y^^ which leaves the vector q 
latent. 

Also, all these five types, as now written, are unchanged by 
any proper orthogonal transformation, such as that which 
replaces the very special unit vector q by any arbitrary unit 
vector qjs/qjq^ none of whose components vanish. Hence a 
typical invariant is given by 

l=-{Gy_+G^J(i\q)l{'^Wqf> • • ^ 27 ) 



3 «o RECENT DEVELOPMENTS . [Chap. 

where and 6f^ are polynomials of the same type 

^ { (? I ?)> (? I «). • • • . {<d>....hq), .... (<*16),...}, 

and k is necessarily a positive integer, since one q enters into 
every type (26). 

Now Gi cannot be zero, else we could cancel out V? j ? and 
then treat as a new G^, This being so, G^ must be zero; for 
otherwise we could always express s/q\q rationally in terms of 
Z, the components q^^ and b, , involved in this equation; 
which is impossible even in the case 

{ 0 , 0 ,... 0 , 1 , 1 }. 

Hence G 2 = 0, so that k must be even, to make the right and 
left sides agree in rationality. Thus we write 

=<?{(?!?). {q\a),..., {ab...hq), {a\b), (28> 

where A is a positive integer, and each qi is non-zero. 

Operating on both sides of this identity 
find, by Lemma 11. 

(q-l Z= {(?!?), . . • , {06 . . . M) } 

where G' is of the same type as G, but may involve the outer 
product {ab . . . hk) which excludes q. 

If we proceed A times, this operation annihilates {q | q) on the 
left, leaving a non-zero multiple of /, and consequently all 
the q^8 disappear on the right, leaving only the types (a\b)^ 
(ab . . .hk). This proves the theorem. 

9. The Hermitian Transformation with an Absolute Quadric. 

If .we apply a general linear transformation ^ = Maj, of 
matrix M == | M | 4= 9, to the canonical quadratic 

= + . . . (29) 

we obtain a quadratic in jCi, . . . , a;,,, which we symbolize by 
r^. Thus 

1 1) = S = S (e,- 1 e*) = r**, (30) 

hj,k j,fc 




HERMITIAN GROUP 


2CXI.] 


3*1 


where (e^ [ e^) denotes the inner product eyCy, + . . . + 
and all summations run from 1 to n. Hence 


r;t »■*=(«; !«»)• 


Let us find out what becomes of the last theorem when the 
variables which enter an orthogonal- transformation f 
now undergo the further transformation | = Mx. Then, if 
a, denote linear sets cogredient with 

(a 1 i8) = J (a I gl) (iS,* + ^,2 + . . . + . 


from which it follows that (a | j3) is a polar form of (a | a). Hence 
(a\a) = r^^\ (a 1 jS) - . . (31) 


in terms of the corresponding linear sets a, j8' after transform- 
ation. Now the inner product of two cogredient symbols a, jS is not 
a projective invariant, but only arises as an orthogonal invariant. 
Here, however, we have expressed it as ra'^/3'» ^ projective in- 
variant of linear symbols together with the symbols r of the 
quadric. In so doing we link the orthogonal theory with the 
projective theory. For if all the ground forms of an orthogonal 
system are symbolized, as may be done, entirely by cogredient 
symbols a, j8, y . . . , then their invariants involve two ty^es only, 

(a|^) (32) 


of which the former is already a projective invariant, giving 

(ai8...^) = |M| 

when a — > a', j8 — > jS', . . . , jLc ft'. 

Furthermore, any non-degenerate quadric may be reduced 
to the sum of n squares TiAiXi^ by a suitable linear transforma- 
tion (§3, p. 300): so that if also = A^^Xi (i == 1, 2, . . . , w), 
the resultant transformation T: a? — > ^ is still linear. Conversely, 
by a? we pass from the orthogonal absolute (^ | $) 

to any given non-degenerate quadric and thereby we solve the 
Hermitian problem (§6, p. 158) of the restricted transformations 
a;— > a?' which leave a given quadric absolutely invariant: 

22 a 


(D884) 



3aa RECENT DEVELOPMENTS [Chap, 

For on performing with the transformation T, we reduce it to 
a proper orthogonal problem. Hence 

For ground forms whose symbols a, jS, . . . are cogredient with 
X, every polynomial invariant of the subgroup in which a given 
nonrdegenerate quadric i/ is an (Absolute invariant^ can be symbolized 
by two types of factor 

(33) 

In other words, proper ortiwgonal invariants of a system of ground 
forms (f) are projective invariants of the system (f, r^^), obtain^ 
by adjoining the absolute quadric Conversely all projective 
invariants of (f, r/) are 'proper orthogonal invariants of (f ). 

Proof of the Converse . — 

For starting with the projective system whose symbols are 
f , a, y . . . , the only types we require by the Fundamental 
Theorem (§7, p. 203) are 

(ajS . . . p), ra, {rs . . . t), 

where r, s, . . .t are equivalent symbols. But the presence of 
{rs . . .t) in an invariant implies the discriminant {rs . . . ^)^ 
(§8, p. 194). If this is rejected, any further factors must 
occur in pairs r^^rp. But {rs . . . t)^ = n\ | | = w!, a pure 

number, in the case when the quadric r^^ is (a? | x). Thus the two 
types (33) alone are actually necessary, and the theorem is 
proved. 

EXAMPLES 

1. The improper orthogonal case is obtained by taking 

= Xi* -f . . . 4* + xn®, 

and transforming the results of the proper case by the matrix of zeros 
with a leading diagonal 1, 1, ... 1, — 1. The requir^ types are (a|^ . . . p)> 
<a|p); but the outer product changes sign after improper orthogonal 
transformation. 

fL The Lorentz transformation leaves 

faj* = + X2® -|- iPs* — 

an absolute invariant, where c® is a constant. 

The invariant types are together with 

r.Tfi = a,p, + «»?*+ - c**,^*. 

This becomes an orthogonal group after the transformation 



LORENTZ TRANSFORMATION 


XXI.] 


3^3 


8. Linear tranaformations, T : x — > x\ satisfying the Lorentz condition 
can be constructed by the matrix (J -f SQ)I{1 — SQ), where 



and 8 is skew symmetric. 


binary matrix {1 4- SQ )/ (/ — SQ) transforms variables Xftto x\ t' accord- 
ing to the Lorenz formulae 



[Use (6), p. 69. 

5. If a, 6, c are contragredient to a, p, y, the outer product types (ahc), 
(afty), (apy), (apy) are all orthogonal invariants of ternary forms. Expressed 
as projective invariants of the absolute the second and third of these 
must be modified. Thus if (a | P) = then we can prove that 

(aby) = (ahr) r^, (a py) = (ars) rp Sy. 

For if f{x) •== Tx^. =: Xi^ + x^^ -f- ^ 3 ^, then j (ars) (rsj py) — J (ars) (rp Sy — ryS^) 
= Uo = i 

«3 

U 3 

6. If rx^ = Sx^ = Xj^ + a:.,^ -f ^ 3 * 4- x^\ then (abrs) (ap | rs) is equi- 
valent to 2(a6ap). 

[Here (S(a6)i;{r5)ji./) (S(aP),y(r«),y) = S(a6)/;(aP)H(r.s)frK^^)w, since 

rirk == 0 , (i 4 = k), ' 

7. Prove, by a Laplace development, that when 

rx^ = a;j2 + + . . . -}- XuK 

* then the concomitant 

(ah.. ,h r^r ^ . . . (ap . . . S | • • • rn-^p) 



involving p linear forms ax, ...» Ax; n — p linear forms Wa, w/s, . . . , M 5 , 
and n — p equivalent symbols r^, . , ^ rn— may be replaced by the outer 
product 

(ah ... A ap ... 8) 

to a numerical factor. 


10. Geometrical Significance of the Adjunction Theorem. 

The preceding theorems obviously have something in com- 
mon: they link affine, orthogonal and Hermitian invariants with 

(' 1 ) 884 ) 22«2 



3^4 


RECENT DEVELOPMENTS 


[Chap. 

projective invariants by adjoining to given ground forms/, one or 
more forms <f> which are latent for the linear transformations of 
their respective groups. For this reason they are called Adjunc- 
tion Theorems, 

If we turn to the geometrical aspect of the theorems we find 
matter of high interest in metrical geometry. It is well known 
that properties of distance and size of angle, holding for plane 
Euclidean geometry, may be interpreted projectivdy by stating 
them as cross ratio properties of a figure to which the circular 
points at infinity are adjoined. 




Thus, for example, if /, J are the circular points, and P is 
a point not on 7J, then the pencil P{QP, IJ } of four lines through 
P is harmonic whenever PQ, PR are at right angles. 

Analytically, in rectangular Cartesian co-ordinates, the matter 
is clear if ax^ -f 2hxy + by^ = 0, y^ 0 respectively denote 
the pairs of lines PQ, PR; Ply PJ, For a + 6 = 0 provided that 
RPQ is a right angle, or, equally well, if P^QR, ZJ } = — 1. 

By taking a general conic or quadric, r^.^, as latent, we 
interpret non-Euclidean elliptic or hyperbolic geometry. If 
r^^ = 4- which is a degenerate conic, we can interpret 

Euclidean metrical plane geometry, by combining the theorems 
of §4 and §8. Thus if ^ — 2, n = 3, and the matrix of 
§1 (1) is orthogonal, then the required result is secured. 

EXAMPLE 

Thus in ternary symbols, let lx === 0 denote the equation of the line at 
infinity, and = 0 that of the circular points, so that can be factorized, 
say 

Then {Xj, X3] and {(x„ p2» ®re the homogeneous co-ordinates of these 
points I and J. 

Furthermore, if /= 03.2 denotes a conic whose tangential equation is 
Wa* = 0, then a covariant conic exists for the quadratics Wo,*, namely 

9 = (acoa;)® == 0. 



XXI.J 


METRICAL PROPERTIES 


3^5 

By the Clebsch principle this gives the locus of a point x whose tangents 
to these conics form a harmonic pencil. Hence in Euclidean geometry it 
is the locus of a point whose tangents to a conic are at right angles. In other 
words the conic 9 is the director circle of the conic /. 

If this (aci>a;)® is written down non-symbolically, whether in Cartesian 
or homogeneous co-ordinates, the ordinary results will be obtained. 

Such examples could easily be multiplied, and indeed they 
form a very attractive analytical projective geometry which has 
received comparatively little attention. 

11. Remarks on the Adjunction Theorem. 

It is very tempting to try to discover a general Adjunction 
Theorem to cover the case when any one or more given ground 
forms <f> 2 , » • • y (f>r are latent for a linear transformacion T. 
For if (/) means the complete projective system of concomitants 
of a set of ground forms /, and ( /, <^) means that of the whole set 
of forms / and then any member of (/, <f>) is certainly an 
invariant of any transformation which leaves each latent. 
But except for a few cases, detailed above, when <f>i is linear or 
quadratic, the converse is not true. Nor has any general law been 
found to determine restricted transformations for a given set of 
latent forms How this converse applies is still an unsolved 
problem of the theory.^ 

It is possible to extend these methods of Study and Weitzen- 
bock to the case when a bilinear form is latent, and to the theory 
of double binary and other multiple fields (p. 240); but in all 
probability the most useful aspects of further work along these 
lines is to be sought in particular applications to ground forms. 

That this Adjunction Theorem breaks down for forms higher 
^ than the quadratic is perhaps one of the most remarkable facts 
of mathematics. It makes one wonder what would have been the 
history of geometry and natural philosophy, had the cubic or 
higher form been a possible absolute on which to base our metrical 
results. For never in the age-long story of measurement, from 
the discoveries of Pythagoras, about 500 b.c., to present-day 
speculations, has the geometer or physicist renounced the 
quadratic as his basis of measurement. The quadratic is one 
of the things which seem to have come to stay. The theorem 

1 Weitzenbdck, Enc^, Math. Wiss., Ill, 3 , 6 (1922), p. 20. Burchardt, 
Math^ Annahn^ 43 (1893), 197-216. 



326 RECENT DEVELOPMENTS [Chap. 

concerning the squares on the sides r, of a right-angled 

triangle, we can write as 

but the latest speculations in general relativity would throw this 
theorem into its infinitesimal shape 

ds^ = dx^ + dx^ == (dx | dx) 

as a special case of a universal formula 

ds^ = 'EgadXidxt = gaJ^ 


in n variables x^, . . . , 

. And what again would have happened if the absolute had 
been not even quadratic but only linear? 

Why it is that the quadratic form should occupy this 
privileged position between linear and higher orders might well 
raise questions of considerable philosophic interest. 

12. Connexion between Differential and Projective Invariants. 

It may be wondered why there has hitherto been such pre- 
occupation with the linear transformation, which after all is only 
a very special case of what in general can be written 

T : x^ =f {Xi , , . . . , ) — Xi{xi , X 2 9 * » • ,Xj^) | (34j 

J’ 

where a set of independent variables x^ is transformed into a new 
set x^\ by definite functional relations not necessarily 

linear. Now the reason lies in the general difficulty of treating 
anything more elaborate. The linear stands in relation to the 
general transformation T, much as linear differential equations 
do to the general theory of differential equations, or in kinematics 
as the velocity of a particle to a finite displacement. The latter 
may disclose quite an unworkable problem to which the former 
contributes a satisfactory *first approximation. 

Assuming each Xi to be a regular function of each xf, and 
vice versa, we can write 

j / 





DIFFERENTIAL FORMS 


XXL] 


327 


for the first differential of each Xi in terms of those of x-. But 
this is ‘patently a Unear transformaiim frtm the set 

{ dxi, dx^, dxn} 

to {^dx^, dx^, . . . , dx^ ]•. Let us denote these sets by dx, dx' 
respectively. Then if c^, . . . , c„ are given functions of , a;,,, 
we can express each c,- as a function of x^, . . . , and any 

n 

linear form S Cidxi as a linear form in dx^'. We write 
C == S CidXi = (c I dx) == (c' I dx'), 


where a new set of functions c/ is derived as coefficients of dx/ 
in C. But this is precisely the theory of contragredience over 
again, and we can accordingly speak of the set of functions 

c — [Cj, Cg, . . . , c,i] 


as contragredient to the set dx. 

Example , — 

Writing in matrix notation dx = Mdx\ c = c'if then 


where I M I 




Thus: 

Arising out of a general transforrmtion x- 


a linear 


transformation dx — > dx' whose matrix M = non-singular, 

together with a contragredient transformation c — > c' for the set 
c of coefficients of a linear differential form Scjdxj. 

In particular let f = f{x) = f' {x') denote a given function 
expressed first explicitly in terms of the xfs and secondly in 
terms of the x/’s. Then its total differential can be written as 


rf/= 


■■:D^dxr 

CXi 


.^^Zdx/ = df, 

dx/ 



[Chai>. 


328 RECENT DEVELOPMENTS 

So, whatever fimction / is taken, the set 

V.l 

( 0 ^* 00:2’ *’’’ 0 a;, J 

is contragredient to [dx^, dx^y . . . , da;,J. 

Once this algebraic idea is grasped — that an inner product 

j d^ is an invariant for contragredient sets, algebraic or 

differential — it throws light on numerous branches of geometry 
an4 physics, bringing them under the rubric of one mathe- 
matical doctrine. Thus it appears that 

Binary forms illustrate the differential geometry on a surface. 
Ternary forms illustrate that of pre-relativity physics, 
QueUernary forms illustrate the present era of physics, 

EXAMPLES 

1. The well-known formula 

ds^ = dx^ -f“ 

for the square of the element of arc of a plane curve in terms of differentials 
in rectangular co-ordinates a?, y, can be looked upon as a binary quadratic 
ground form in homogeneous variables [da;, dy], 

2. The analogous ternary formula 

da^ = da?!® -f- darg® -f darg- 

for the arc of a space curve is again a quadratic. Then if a linear trans- 
formation dx — > da;' leaves this absolutely invariant, we have another 
example of orthogonal transformation. In this case 

da^ = (da; | dx) — (dx' | da;') = da'®, dx — Mdx\ 

and M is orthogonal. 

8. Or, again, the potential function V of three variables Xi, X 2 , leads 

f dV dV dV\ 

— , -- — , _ i contragredient to (da;i, dajg, dx^, 

A full account of this differential theory can be read in many 
recent publications.^ But here it may be useful to refer to 
the likenesses and the contrasts between the algebraic and the 

An excellent introduction is given in the Cambridge Tract* No. 24 (1927), 
Veblen, Invariants of Quadratic Differential Forms, A larger work is the English 
translation of an Itolian work: Levi-Civita, The Absolute Differential Calculus 
(Blackie, 1926). Cf. WeitzenbSck, Jnvariantentheoriey Chap. XlII. 



XXL] 


DIFFERENTIAL INVARIANTS 


329 

differential theories. Both contain ground forms, linear trans- 
formations, and concomitants relative or absolute, though they 
are somewhat disguised by having different names. Certain 
processes, too, can be recognized as identical. But in the 
differential theory emphasis is placed on the Censor (pp. 91, 
200), or set of coefficients of a multilinear form where 

ijk . . . are called indices of covariance and rst , . , indices of 
contravariaiice. Thus a set is called a covariant vexior and 
a’’, a contravariant vector. 

It should be carefully noted that this use of the words 
covariant and contravariant is quite different from their use 
in algebra. 

From the algebraic point of view the most interesting fact 
of the differential theory of forms is the existence of a Reduction 
Theorem first discovered by Christoffel,^ whereby the problem 
in differential invariant theory of tensors and their derivatives 
up to a given order, is identical with that of the projective in- 
variant theory. It is noteworthy that from a physical point of 
view the most important algebraic forms are the linear Wa, 
the linear complex {ah\ xy), the quadratic and 

the quadratic complex [B [ xyY ==- h These quad- 

ratics figure prominently as the differential form ds'^ ^ 'Lgi^dxidx^ 
and the Riemann-Cbristoffel curvature tensor 

13. Prepared Systems. 

Although the general theory of binary forms is fairly complete, 
little is known of higher categories beyond the irreducible system 
of a ternary cubic and certain linear or quadric systems. The 
fundamental theorem works very well for ternary forms, because 
(abc), af^, (ajSy) are the only types of symbolic factor which may 
arise even if compound co-ordinates are utilized (§11, p. 210). 
Quaternary forms require implicit convolution (§11, p. 211) and 
thereby provide great complications. How, for instance, do the 
symbols of a linear complex (aa'| xy) fit in with the types {abcd)y 

(ajSyS)? This has suggested the problem to supplement the 
three fundamental symbolic types by further types so as to re^ider 
all convolution explicit. To such systems of symbolic types 
th^ name Prepared Systems has been given. 


1 Grelle, 70 ( 1869 ), 46 - 70 . 



330 RECENT DEVELOPMENTS [Chap 

The prepared system for quaternary forms ^ consists of thirteen 
types: 

(abed), (a/S), (ajSyS), (aAb), (aAP), (AB), (aABa), 
(aABCb), (aABCP), (aABCDa), 

(aABCDEb), (aABCDEp), and (ABCDEF). 

Here capital letters have currency two (§6, p. 37) ; and if ^ = a'a", 
B = b'b”, &c., then (aABCb) is defined as 

(aAb') (b"Cb) - (aAb") (b'Cb) 

with analogous definitions for the others. A prepared system 
in general gives the complete system for all possible linear ground 
forms a^, . . . , , (-^ 2 ^ 2)5 * * • > 1 • • • » present 

nothing is knoA\Ti beyond the quaternary case. 

14. Quantitative Substitutional Analysis. 

Determinant, matrix, symbolic invariant, tensor, and group 
theory are but variations on one theme — permutations and com- 
binations. Here algebra begins and here it appears to stay. 
But how many of us have ever thought it worth while to study 
the very ABC of substitutional processes? or have even inquired 
if they have an ABC? 

Suppose, for instance, it is known that a certain function 
f{x,y, z, u, v) of five arguments is symmetrical in x, y, skew sym- 
metrical in y, z, u and also in x, v: then what are its characteristic 
properties? Is there a calculus behind this kind of inquiry which 
will obviate the necessity of examining every case for itself as 
it arises? There is. About thirty years ago Frobenius^ and 
Young ® appear to have made independent discoveries which 
lead to a systematic calculus of substitutions. Their work links 
these questions with the theory of matrix equations. By so 
doing, it gives a kind of canonical form to whole groups of 
substitutional properties. The bare fact that the natural sequence 
in this algebra is not of the order 1, 2, 3, . . , but rather is that 

1 Pr^, Lcyndon Math, Soc,, 2, 21 (1923), 381-8, and 2, 25 (1926). 303-327. 

* “ tiber die Darstellung der endlichen Gruppen durch linoare Substitu- 
tionen’*, Berliner Sitzungsberichte, 1 (1897), 2 (1899). 

* Proc, London Math, 80c,, 1,88 (1901), 84 (1903); 2 . . . (1928), and Journal 
8 (1928), On Quantitative Substitutional Analysis ”. 



SUBSTITUTIONAL ANALYSIS 


XXL] 


331 


of 1!, 2!, 3!, . . . shows where the practical difficulties lie. Napier 
was prompted to invent logarithms solely by the difficulty of 
computing long multiplication sums. Cai> a like benefit, it may 
be asked, be found for algebra, and have these pioneers brought 
it within sight? 


MISCELLANEOUS EXAMPLES 

1. If capital letters denote square matrices of order n, I being the 
unit matrix, and if small letters denote scalar numbers, prove that 

(/ + 'pAB)A(I -f qBA) - (/ + qAB)A{I -f pBA). 


2, Prove that (/ + pAB)A(I + qBA)B(l -h rAB) is symmetrical in 
P, q. r. 

Z.lf A:B means AB-\ prove A : B ^ AX : BX, ii \ BX \ 0. 

A G 

n TA I * * • usual meaning, of a continued fraction, 

ij 4“ B -j~ 

provided that division is always performed on the right, investigate the 
law of successive convergents, P/Q. 

[With proper safeguards, the usual scalar law is true. If ^ ^ = . . . = 7, 

B == A^i D = A 2 t • • . * then Pn" Pn~l-4u4“ Pa-^'lt Qn’-lAn-\’ 

5* Given 



“ 4 

-1 

0“ 


" -1 

12 

5" 


-1 

3 

2 

and B = 

12 

11 

9 


. 5 

7 

6_ 


34 

57 

37. 


find the most general matrix X, satisfying the equation AX = B 

(Edinburgh,) 


6. Show that A — 



3-1 

7 

2 

0 - 


satisfies the equation 


A^ 4* 83i4 == 0. Obtain the Cayley-Hamiltonian equation (§2, p. 99) for 
A, and discuss the connexion between the two equations. 

(Edinburgh,) 

7. If 0, 9, are arbitrary then the matrix 


cos 9 cos6 cosi^ — sin9 sinij/, 

— cos 9 cos 0 sin -- sin 9 cos 
L cos 9 sin0, 

sin 9 COS0 cosiJ;4-cos 9 sini}/, 
— sin 9 COS0 sin^J; 4" cos 9 cos^J/, 
sin 9 sin0, 


sin0 cos<j>’ 
sin0 sin4> 
COS0 


is orthogonal. Express it in the form (I -- S) (I S) where S is skew 
■ symmetric. j 



332 


RECENT DEVELOPMENTS 


[Chap. 


8* Prove that 



-11 . " 


- 1-1 . “ 


-2 . .- 

.4 = 

. 1 . 

. 5 = 

. 1 . 

. • • 1 - 

, (7 = 

. 2 . 

• 1 - 


are commutative, and that the functions A B C, BG -f CA + AB, 
ABC are scalar, and equal to the corresponding functions of the latent 
roots of A, {Edinburgh,) 


9. Prove that if -4 = 


rl -1 
4 -3 
6 -3 
4 -1 
-1 . 


1 -1 
2 -1 
1 


1-1 


, then = /, and 


generalize the theorem. Prove that the characteristic equation satisfied 
by A is (z^ — 1)* (z — 1)“ ^ = 0. 


10. If n, m, k are positive integers, and a function co satisfies the relation 
6) (n, m) -f- <0 {m, k) ~ oi (n, k). 


show that o>{w, m) is the (n, w)th element in a skew symmetric matrix. 

(Heisenberg.) 

ll.lt prove oosna, einual 

L — sma cosaj L— sinna, coswaJ 

give a geometrical explanation. 

If Ap is a similar matrix for an angle p, prove A, B commute and that 

AaAp == Aa + p» 



"a h 


-p q 

r- 

12. If ^ - 

. a b 

. p= 

• P 

Q 


• 


_ • • 

P- 


then AP, PA both have 


the same form, with constant values throughout the parallels to the leading 
diagonal, and zeros below. 

Generalize this feature. 


13. li A, B are reduced to normal form PLP-^, QMQ^^ where L and 
M are diagonal matrices both with n distinct latent roots X^, X 2 , . . . , Xn 
and pi, {X 2 > • • • » prove that a non-zero matrix X can be found to satisfy 

AX=XB 

if and only if A and B have at least one latent root in common. In this 
case X is called a commutant of A and B. 

[Let Y — JTQ; then LY = YM\ and equate corresponding 
elements. 


14. If AX = XB, prove that X is a commutant of /(^) and f{B). 

[First prove A'^^ X = XB^K 


16. Desargues' Theorem.-^ln ternary forms, let a, 6, c, a\ h\ c' denote 
coplanar lines. Show that the equation 

Pj , = (Vc'b)cx - (6'c'c)6;c = 0 



EXAMPLES 


XXL] 


333 


represents the line joining the intersections of 6, c and of 6', c\ If qx, Vx 
denote similar expressions for c, a and a, h prove that 

(pqr) — — (ahc) (a'bY) {aa' . bh ' . cc') 

where 

(aa' . hb ' . cc') = {abb') (a'cc') — {a'bb') {acc'), 

[Write 'Xc — y.b, q=== X'a — jjl'c, r == X"6 — (x^a, where X = (h'c'b), 
&c., are scalar. Then {pqr) = XX'X"(cab) — all other terms 

vanishing. The result follows by using the identities 

(b'c'b) (c'a'c) ^ {b'c'a') (c'bc) + (b'e'e) (c'a'b) 

(c'a'a) (a'b'b) — (c'a'b') (a'ab) + (c'a'b) (a'b'a). 


State the geometrical result, and the dual re^sult involving points a S, v, 

a', P', y'.] 

16. Prove the identity 

(ax ,by , cz) + (ay . bz . cx) + (az .bx,cy) = 0 

between any six coplanar points a, b, c, x, y, z. The symbols are cogredient 
and equally well can represent lines. 

Pappus' Theorem, — For six coplanar points A, B, C, X, F, Zy if AXy 
BYy CZ meet in a point, and AY, BZ, CX meet in a point, prove that 
AZ, BXy OF do so. 

[Expand each compound determinant. 

17. If ('iax . by . cz^) ^ (^axb) (yez^) — (^axy) (bez^) where the symbols 
are seven cogredient points in the quaternary field, prove that this expres- 
sion equated to zero is the equation in $ of the quadric surface containing 
the lines ax, by, cz as generators. 

[The point Xa [ix lies on the surface, &c. 

18. Prove the identity 

(^ax . by . cz^) + (^ay . bz . cx^) + (^az . bx . cy^) = 0. 

If ABCXYZ is a given skew hexagon in space, prove that the quadric 
surface containing AX, BY, CZ as generators, and that containing AY, 
BZ, CX as generators, and that containing AZ, BX, OF as generators are 
linearly related and have a common curve of intersection. 




INDEX 


Absolute, 320. 

Absolute invariants, 206-7, 275. 

Addition of matrices, 34, 70. 

Adjacent terms, 15, 251. 

Adjugate, 67, 104. 
compound, 87. 

Jacobi’s theorem on the, 79. 

Adjunction theorem, 323. 
for affine group, 310. 
for orthogonal group, 316. 

Affine group, 162, 309. 
invariants, 227, 310. 
symbolic expression for, 310. 
transformation, 309. 
with fixed point, 162, 315. 

Aitken, 20, 108. 

Algebra, definition of, 281. 

Algebraic complement, 26. 

Algebraically complete system, 231, 234. 

Alternant, 28. 

Alternation. See Determinantal permutation. 

Analysis, 281, 326. 

Anharmonic ratio. Sec Cross ratio. 

Annihilators, 227. 

Apolarity, 262-9, 278. 
defined, 264. 

Argand diagram. See Gauss plane. 

Arithmetic, 281, 

Aronhold, 141, 174, 179. 

Aronhold operator, 140, 180, 207, 260, 302. 

Associative law, 57, 61. 

Ausdehnungslehre, 58. 

Axial co-ordinates, 87. 

Basis theorem of Hilbert, 235. 

Battaglini complex, 289. 

Bazin, 55, 56, 108, 225. 

Bell, 177, 181. 

Bernoulli, 13. 

Bezout, 46. 

Bilinear forms, 108, 294. 
binary and higher, 286. 
geometrical interpretation of, 283-7. 
latent in transformation, 159. 

Binary field, 10, 41, 50, 59, 128-46, 151, 
173. 177-81. 191, 213, 2IS-33. 244-7, 
263, 266, 269, 283-6. 
double, 243. 

B6cher, 160, 296, 304. 

Boole, 128, 139. 

Bromwich, 304. 

Burchardt, 325. 

Canonical forms, 265. 
binary, 219, 244, 246. 
of general quadric, 300, 302, 

Canonical matrix, 296. 

Capelli, 1 12, ij6, 253. 
operator, 1x2, 256. 

Cartesian co-ordinates, 3, 5, 128-32, ISS* 
163, 178, 280, 283, 310, 325, 328. 


Cauchy, 55, 56, 67, 87, 108, 165. 

Cayley, 2, 4, 5Q. 132. I33. ISS, 156. 174 
227, 234, 296. 

Hamilton theorem, 99, 296, 331. 
operator, 113-S. 122, 123, 188, 203, 2Xi. 

Characteristic of determinants, 55. 
equation, 98, 107, 292. 

Chinese, 6. 

Ciamberlini, 307. 

Circle, 243. 

Class C+, C~, 15. 

Clebsch, 140, 172, 174, 179, 2X0, 247, 248, 
2S5> 258, 260. 
theorem of, 248. 

transference principle, 287, 299, 316. 

Cogredient, transformation defined, 149. 
sets of variables, forms with, 199, 270, 

Collineation, 291, 295. 

Combinants, 242. 

Commutant, 332. 

Commutative law, 57, 61, 110, 257. 
matrices, 71. 

Complement, algebraic, 26. 

Complete systems, 243. 
algebraically, 231. 
of binary cubic, 244. 
of binary quartic, 245. 
of binary various, 246. 
of general quadric, 297. 
two general quadric, 304-6, 

Complex, linear, 212, 329. 
quadratic, 2^8. 
quaternary line, 329. 

Complex variable, 243. 

Compound co-ordinates, 85, 86, 209, 249. 
determinants, 49, 87. 
inner product, 81, 83, 209. 
transformation, X63, 165. 

Cone, 299* 

Conformable matrices, 34. 

Conic as binary form, 284. 

as ternary, 178, 286, 289, 298, 302. 
polar, X78, 286. 

Conjugate matrices, 6. 
points, 287. 289. 
primes, 287. 

Continued fractional matrix, 331. 

Contracted functional notation, 20. 

Contragredience, defined, X49. 
and correlation, 295. 
fundamental property of, 149. 
of point and prime, 150, 151. 

Contra variant, 206, 329. 

Convolution, implicit, explicit, 46, 225, 253. 
and resolution, 207. 
double, 193. 

Co-ordinates. See Cartesian^ compound, 
homogeneous. 5, 86, 151, 2x2, 265, 283- 
308, 31S. 325- 

Correlation, 295. 

Correspondence, 280, 281, 291, 



INDEX 


336 


Correspondins matrices, theorem of, 79, x 16. 

Covariants of binary forms, 143. 
as invariants of linear forms, 145, 207. 
defined, 14^. 

in the relativity theory, 329. 
of degree two, 230, 274. 
of general forms, 206. 

Cramer, 13. 

Cross ratio, 283. 

Cubic, binary, 244, 266. 
syzygy, *44- 
twisted, 286. 

Cullis, 296. 

Currency, defined, 37. 

Degree defined, 134, 172. 

Derangement, 13. 

Desargues, 332. 

Determinant. See Adjugate, Alternant^ 
Compound. 
bordered, 101, 299. 
characteristic, 55. 
definition of, 17. 
differentiated, 110, 123. 
duality of, 51, 89, 92. 
expansion of, 19, 98. 
extensional, 42-9, 287-9. 
functional. See jMobian. 
irresoluble, 33. 
logarithm of, 124. 
multiplication of 65. 
notation of, i, 27, 37. 
reciprocal, 66, 89, 103. 
skew symmetric, 105. 
symmetric, 104. 

Determinantal permutation, 27, 38, 43-5, 
48-51* 210. 

as a differential operation, lai. 

Diagonal matrix, 10 1. 

Dickson, 64, 296, 304. 

Differential equation satisfied by deter- 
minantal series, 123. 
satisfied by normal forms, 256-8. 
forms, 327. 
invariants, 329. 

Differentiation of a determinant, 123. 

Dimensions of a group, 161. 

Discriminant, 129, 131, 140, 191, 192, 218. 
of conic, 287, 298, 322. 

Distributive law, 57, 6i. 

Division law. 57, 64. 

Double biViary forms, 138. 243. 
convolution for quadrics, 193. 
suffix notation, 63. 

Duality, principle of, 262, 282, 284, 298, 301. 
and determinants, 51, 92. 
formal, 54. 

Elimination leads to invariants, 274. 

Elliott, 232, 246. 

Elliptic geometry, 315. 

Equations, linear, solution of, 10, 75, 84, 195. 

Equivalence problem. Z77. 

Equivalent forms, 277. 
symbols, 179, 1 91, 201. 

Euclidean geometry, 3x5, 316. 

Euler, III, 157, 256. 

Extensionals, 42, 49, 287. 

Extrinsic terms, 116. 

Factor. See Symbols. 

Ferrar, 55. 

Field. See Binary, Ternary. 
currency, 37, 54. 
number, 9. 

Finiteness theorem, 233-40. 

Fore and after factors, 64. 

Form, 30, 133, 168, 176. 

Formal dual, 54. 

Forsyth, 233. 


Frame of reference, 293. 

Franke, 108. 

Frobentus, 64, 128, 258, 330. 

Fundamental theorem, nrst, 182. 
second, 2X4, 225, 314. 
affine, 312. 

for general forms, 190, 203, 208. 
for linear forms, 182, 187. 

Hermitian, 322. 

Gauss, 128. 

Gauss plane, 285. 

General forms, 18 1 . 

Generator of quadric, 299. 

Geometry, 280-96. See Cartesian, Co- 
ordinate. 

Gilliam, 223, 307. 

Gordan, 172, 193, 214, 234, 238, 242, 243, 
247. 255. 258, 307, 314. 

Gordan’s theorem, 233, 261. 
proof of, 238. 

Gordan-Capelli series, 253, 254, 255, 270, 
^ 272, 313- 

Grace, 273, 274. 278. 

Grace and Young, 114, 234, 238, 242, 246. 

253. 2O1, 284, 307. 

Gradient, 134, 231. 

expressed as sum of coefficients of co- 
variants, 270. 

Gram, 270-4. 

Grassmann, 58. 

Ground forms, 172. 

Group, defined, 160. 
affine, 161, 309. 

f eneral projective, 161. 

lermitian. 320. 
homogeneous, 309. 
isobaric, 310. 

Lorenz, 322. 
orthogonal, 160, 322. 

Group property, 283. 

Hamilton, 58, 99. 

Harmonic range, 283. 

invariant of quadratic, 316. 
extensional of, 287. 

Hermite, 150, 320. 

Hessian, 181, 222, 231, 233. 

identical vanishing of, 274* 

Hilbert, 172, 234, 235, 240, 261, 314- 
Hdlder, 57. 

Homogeneity, 30, 280. See Co-ordinates. 
of invariants, 171. 

Homographic ranges and transformations, 
291. 

Hyperbolic geometry, 315. 
Hyperdeterminants, 132. 

Identical transformation, 161, 270. 
Identities, fundamental, 44, 
binary, 213, 278. 
dual, 93. 

general, 214, 278. 

Laplace, 41-56, 93. 

Sylvester, 45, 94‘"7* 

Improper orthogonal group, 322. 

Index law, 68. 

Induced transformation, 135. 

Inner product, 62, 82, 83, 316. 
compound, 83. 

Integration of a rational function, 28. 
Intrinsic terms, 116. 

Invariant, defined, 138, 169. 
as elimination result, 275 • 
as solution of differential equation, 230-* 
equations, 271. 
factors, 304. ^ 

of binary forms, 128, 244. 
of general forms, 184, 189. 
of multiple fields, 249. 



INDEX 


337 


Invariant of afline group, 309. 
of orthogonal group, 130, 316, 320. 
process, 141 

projective, 129, 139, 169. 

Inverse transformation, 135. 

Inversion, 243. 

Involution, 219, 284, 292. 

Irreducible systems of forms, 233. See 
Complete Systems, 

Isobaric. See Weight. 

Jacobi, ratio theorem of, 77, 89, 108. 
lemma of, 125 

Jacobian, i 24 - 7 » i Si. 231. 242, 327. 
and canonical forms. 268. 302. 
is a covariant, 143, 182. 
of a Jacobian is reducible, 223. 
of binary forms, 143, 181, 219, 221, 284. 
of two quadratics is harmonic to both, 284. 
product of two, 223, 307. 
rank of, 126. 
vanishing of, 126. 

Jessop, 304. 

Jordan’s lemma, 279 

Kasner, 243. 

Klein, 224. 

Kronecker, 33. 

Lagrange, 128, 229. 

Laplace’s development of a determinant, 22, 
^ 83, 323. 

Lasker, 267. 

Latent points, 292, 296, 315. 
primes, 314. 
quadric, 315. 
roots, 98, 101, 292. 

Leading coefficient, 226. 

diagonal, expansion by, 98. 

Lehnen, 243. 

Levi-Civita, 328. 

Line co-ordinates, 85, 86, 285. 
geometry, 85. 

Linear dependence, 8, 50, 73. 
equations, 6, 10, 285. 
forms, 31. 

forms, invariants of, 145. 

Linearity, 30. 

Macmahon, 1 19. 

Matrix, dehned, a. 
commutative, 71, 332. 
diagonal, loi. 

Jacobian, 327. 
null, s. 

orthogonal, 152, 153-7, 331. 
reciprocal, 68 . 
scalar, 71. 

’ singular, 70. 
unit, 68. 

Matrix properties, 34, 59, 70. 
and quaternions, 166. 
canonical form of, 296. 
function of, 71. 

geometrical interpretation of, 291, 295. 
transformation, 149. 

Mertens, 258. 

Metrical properties, 324. 

Meyer, 234, 242, 308. 

Minor determinant, 21, 26. 

Mixed concomitant, 206. 

Modulus of transformation, 169, 311. 

Muir, 29, 46, 87. 

Multilinear forms, 197-201. 

invariants, 142, 261. 

Multiple fields, 240. 

Multiplication of matrices, 59-62. 
scalar, 61. 
fore and after, 64. 


I Napier, 331. 

National independence, 234. 

Net, 260. 

Noether, 247, 258, 261, 
Non-commutative, 59, 119, 

Norm, 1^6. ' 

Norm curve, 285. 

Normal form, 255. 

Null matrix, 5, 61. 
system, 295. 

Operator. See Aronhold, Capelli, Cayley, 
differential, 227. 

Order of matrix, 1,7, 17. 
of polynomial, 30, 133. 

Orthogonal. See Group^ Matrix 
invariants , 315. 

Outer product, *183, 316. 

Pappus, 333. 

Parallel property, 310. 

Partial fractions, 28. 

Pascal, 214. 

Pcano, 243, 260. 

theorem of, 261, 314. 

Pencil, 218, 260. 

Permanent, 14, 251. 

Permutation, 13. 
determinantal, 27. 

Perpetuant, 246. 
double binary, 243. 

Picquet, 109. 

Platonic solid, 224. 

Polar, adjacent terms of, 251. 
forms, 37. 
prime, 287, 294. 
reciprocation, 295. 
symbolical expression for, 177. 

Polarization, no. 
an invariant process, 207, 250. 

Polynomial function of matrix, 71. 

Prepared system, 329. 

Prime, 86, 163, 282. 

Product of matrices, 61, 63, 71. 
inner, 62. 
outer, 183. 

Projection, 290. 

Projective property, 271, 290. 

determined by invariant equation, 272. 

Proper orthogonal group, 316. 

Pythagoras, 325. 

g-numbers, 59. 

Quadratic. See Complete system, Conic, 
as determinant, 105. 
latent in transformation, 158, 315. 
reduction, 194. 

Quantic, 133. 

Quartic, 245. 

Quaternary variables, 262. 

Quaternion, 166. 

Hank of matrix, 5, 73, 75, 84. 
of quadric, 299. 

Rational curve, 285, 

Rationalitv, 10, 12, 157, 233, 320. 

Reciprocal matrix, 68. 

Reciprocation, 16, 283, 294. 

Reducibility, 215. 

Reiss, 108, 109. 

Relative invariant, 129, 277. 

Relativity, 201, 233, 326. 

Resolution, 208. 

Restricted transformation, 227. 

Resultant, 275. 

Reversal law, 68. 

Riemannian geometry, 315, 329. 

Rigid displacement, 13 1, 155. 

R<^rigues, 157. 

Rothe, 29. 



INDEX 


338 

Saddler* 243. 

Salmon* 132* 302. 

Scalar inatance of symbols, 175, 177. 

Scalar matrix, 71. 

Schwarts, 243. 

Self-conjugate simplex, 299, 304. 

triangle, 299. 

Seminvariant, 226. 

Similar forms, 259. 

dual forms, 262. 

Simplex, 84, 293, 299, 304. 

Singular matrix, 70. 

point, apolarity theory of, 265. 

Skew symmetry, 105. 
of determinants, 105-7. 
of matrices, 36. 
of null system, 295. 

Smith, 33. 

Space of n — I dimensions, 282. 

Standard forms, 250. 

Straight line, geometry on, 283. 

Stroh’s lemma, 278. 

Study, 214, 243. 258, 325. 

Subgrouf), 16 1, 309. 

Substitution, 128. 

Substitutional analysis, 119. 330. 

Summary of matrix laws, 71. 

theorems on compound determinants, 108. 
Sylvester, 46, 55, 56, 87, 108, 121, 132, i33» 
16s, 227, 234f 250. 

Symbols, defined, 173, 175, 198. 
contragredient, 200. 

effect of linear transformation 00,183,201. 
Symbolic factor types^ 182. 

Symbolic linear equation, solution of, 196. 
Symmetric function of roots, 134. 

matrix, 36. 

Syzygy, 224, 23 1. 
cubic, 244. 

finiteness of system of, 239. 
for quadratics, 220. 
quartic, 246. 


Tangential equation, 285. 

Tensor, 89, 90, 200, 329. 

Transference principle. See Clebsch. 

Transformations, linear, 59, 128, 168. 
defined, 147. 
form a group^ 160. 
general functional, 151, 326. 
induced, 135-7, 148. 

See Group. 

Transposed matrix, 5, 36, 71. 

Transposition, 70. 

Transposition properties of determinants, 

^ 38. 

Trans vectants, 221. 

Turnbull, 46, 55, 243, 307* 

Types, 217* 

Unit determinant, 32. 
matrix, 61, 68. 

Upper suffix notation, 77, 89, 200. 

Vaidyanathaswamy, 243. 

Valency condition, 186, 190, 1 91, 231 

Van der Waerden, 225. 

Variables, dual, 90. 
compound, 86. 

Veblen, 328. 

Vector, point, 36, 84, 86, 91- 
of orthogonal group, 317. 
prime, 36, 84, 86, 91. 
properties of, 59. 

Von Gall, 246, 

Wakeford, 267. 

Weight, 134, 170, 310. 
homogeneity of, 171. 

Weitzenbbck, 46, 2i4» 225, 239, 240, 258, 
261, 308, 309, 315, 32s, 328. 

Whittaker, 55, 56. 

Williamson, 307. 

Young, 247, 258, 330. See Grace and Young, 




DATE OF ISSUE 

Thift book sivfii be returned 
within 3*7, 14 days of ite iamie. A 
Hne of ONE ANNA pea day wiU 
be oharged if tho book is oveedtie. 





^ 'ViOu*!^ 

Vi GO ^ CkA*-0 OJM.' 


